Gene Caul_5456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5456 
Symbol 
ID5897090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp169980 
End bp171623 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content66% 
IMG OID641550743 
Productintegrase catalytic region 
Protein accessionYP_001672229 
Protein GI167621721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0520655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGATG ACCCGGATAT GCGTCCAACT GGAGGCGCTG AAGAACGCTG GCGCAGGGCG 
CTTGAGCGCC AACCAGAGCT ACGAAAGTTG GTTGAGGCTC CCAGCCGAAG CCGAGCGGAC
GTGGAAACCG CCGCCACCCG ACTTGGCTTG CATATCTCGA CGCTTTACCG GCTGCTGCGC
CGCTTTGAGG TCGATCGGAC TGCGGAGGCG ATCACGGGTA GAGCGCGGGG ATGGTGCCCG
GGACGCAGCC GCGTCCCGGC CTTGATCGAG ACCGTGATCG ACGCGGCGAT CCAAGACTTC
TTCCTTACCA AGCAGGCTCC GTCCGCCGCA GCGTTGCATC GAGAGATCGA GTTGCGCTGC
CGGGCTACTG GATTGGCAAC ACCTGCAATC TCGACCGTCC ATCGGCGGCT GCGCAAGCTC
GGCCGTAAGA CGGTCGCGGG GCGCCGCGAG GGGAGAGGTG CAGCCGAGGC TCAGACTATG
CGGCCCGGCG CCTTGCAGAT CGACCGACCC AATATGCTCT GGCAGATCGA CCACTCCCCA
GCCGATGTCG TGATTGTGGA CGTCGAGACA CGGGCGCCGA TCGGACGGCC GTGGGTGACG
CTAGTGATCG ACGTGGCCTC GCGTGTCATC GCCGGCCTGC ATGTCTCACT CGAAGATCCG
TCGGTGATCT CAGTCGGACT GGCCTTGCGC CACGCCATCC TGGATAAGGC CGACGCCCTG
AGGGAACGGG AGGTGGCTGC GGACTGGCCA GCCTTCGGTT TGCCGGATGG CGTCCACAGC
GACAACGGCT CCGATTTTCG AAGCGCGACG TTCCAGCGGG CCTGCGCCAA CCTCGGTATC
GAGATCGACT ATCGGCCCCT GGGCGCGCCG CGCTATGGCG GTCATATTGA GCGCCTGATT
GGGACCGCGC AGCAGGAGAT GCACCTGCTG CCGGGTACAA CCTTCTCCAA CGTCTCACAG
CGCGGTGACT ATGACAGCGA CGGGTCCGCC GCGCTCACAC TGGACGAGTT TGAGACCTGG
CTTTGGCGGT TCATCGCTAG CGACTACAAT ATGCGGATCC ACTCCGTCAC GGGTCGGCCG
CCGCTAGTGG CCTGGCGCTG CGGCGTCGAT GGCCAGGGCT TTGCTCCGCG GAGGCCGAGC
GATCCGGAGC GTCTAGCCTT CGAATTCCTG CCTTCGGTAC CGCGCGCCAT CACCCGCCAG
GGCGTCGTCT TCAACCGCAT ACATTACTAT GAGCCGTTCC TGGAGCCGCT GTTCGACACT
GGGGATCGTA GGCTGCTGGT CCGCTACGAC CCGCGCGATC TGTCCCGTCT CTACCTGGCC
ACGGCGCAGG GTGTTCAATC CATTCGCTAC CGCAACCTCG CGCGCCCACC CATGAGCTTA
TGGGAACTCC GGGCGGCCAG ACGACGCCTC GCTGCCGAGG GCGCGGCCCA TGTTAACGAA
GACGCGCTGT TCGAGGCGCG ACGACGCAAT GTCGAGTTGG TCGGACGGGC GAAGAGCGAG
ACCCGCCGTC AGCGCCGTGA CGCCGAACGC CGCGACCGCG GCTATGCGGC GGCGTTGGCT
CCTGACCCGC CACAGCCCGA TCCGGAGCCG GTCGCGTCGC TGGGTCCCAT CAGCGGTCGA
GTCGGGGAGA TTGAGCAGTG GTGA
 
Protein sequence
MVDDPDMRPT GGAEERWRRA LERQPELRKL VEAPSRSRAD VETAATRLGL HISTLYRLLR 
RFEVDRTAEA ITGRARGWCP GRSRVPALIE TVIDAAIQDF FLTKQAPSAA ALHREIELRC
RATGLATPAI STVHRRLRKL GRKTVAGRRE GRGAAEAQTM RPGALQIDRP NMLWQIDHSP
ADVVIVDVET RAPIGRPWVT LVIDVASRVI AGLHVSLEDP SVISVGLALR HAILDKADAL
REREVAADWP AFGLPDGVHS DNGSDFRSAT FQRACANLGI EIDYRPLGAP RYGGHIERLI
GTAQQEMHLL PGTTFSNVSQ RGDYDSDGSA ALTLDEFETW LWRFIASDYN MRIHSVTGRP
PLVAWRCGVD GQGFAPRRPS DPERLAFEFL PSVPRAITRQ GVVFNRIHYY EPFLEPLFDT
GDRRLLVRYD PRDLSRLYLA TAQGVQSIRY RNLARPPMSL WELRAARRRL AAEGAAHVNE
DALFEARRRN VELVGRAKSE TRRQRRDAER RDRGYAAALA PDPPQPDPEP VASLGPISGR
VGEIEQW