Gene Caul_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1889 
Symbol 
ID5899344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2026183 
End bp2028636 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content65% 
IMG OID641562379 
ProductTonB-dependent receptor 
Protein accessionYP_001683516 
Protein GI167645853 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATCA CAAAGACGGG CGTGTCGTGC GCGGCTCTGG CCTGGGCGCT GGCGTCGTGG 
TTGTCCCCGG CTGCGTTCGC GCAGACCCAA ACGCAGCCCG CCCCGGAGGC CGCGACTGCG
GTCGACGACG TCGTCGTCAC CGCCCGTCGC CGTGACGAGG CGCCGATTTC GGTGCCGGTG
TCCGTGACCG TTCTGGGTGC GGCCCAACTC AAGATGCTCG CGGTGGACAG CCTGCAGGAT
TTGCGCGCGG TGACCCCCGG CATTTCCGTG GGCGAAGTGT CCGGCGGCGT GGGCGGCACG
GTGGCGCTGC GCGGCGTCGG CACGACGGCC GGATCAAATC CGACGTTCGA GCAGACGGTG
GCGATCAATG TCGACGGCGT CCAACTGTCG CGCGGCGGCG CGGTGCGCGT CGGTCAGATC
GATATGGAGA AGATCGAGGT TCTTCGTGGG CCGCAGGCGC TGTTCTTCGG CAAGAACAGC
CCGGCCGGAG TGATCTCGAT CACCACGGCC GACCCGACCT CGACGTTCGA ATCCGAGGTG
CGCGGCGGCT ATGAGTTCAA CGCCAACGAA CGCCAGCTCG AGGCGACAAT CTCCGGTCCG
CTGACGACCA CGCTTGGCGC TCGCCTGGTT GTCTATGGTT CTGACATGGA CGGTTGGCGC
GACAACACCG CCGACGCGGC GGCGGCGGCG GCAAATGCGA TCCGGCCCGG CTCGGTGACC
GGCTCGACCA GGAGCGGTCC GCAGCAAAAG TTCTTCTTCA CGCGCGGAAC CCTGAAGTGG
CGACCCAACG ATGCGTTCGA CGCCCGGCTC AAGCTCAGTT ACGCCGACAA CAAGGGCATC
GGCTACAATC AGGCCGGCGG GTTCCAAAGG ATCTACTGTC CCAGCGGGGC GCCCCAGCTG
GCGCCGCAGT CGACGGCCCT GAACGGCGGC GCGGCCAATC CGGCCCTGGC GGCGGCGCTG
TCGGTCGACA ATTGCCGCGC CGACGCGACC TACGCCAACG GCAACATCAA CCCGGCGTTC
CTGGTCGGCT CGCCAGAAGG GTTCACCGAT CCGGACGGGG CCGGGAACTA CAGCCAGCAA
CTCTATTCGC TGGAAATGAA CTATCGGCCG AACGACTGGC TCGCGCTGAC TTCGGTCACC
AGCTGGGCCA AGAACGAGGA TTTCCGGGCC GACACCTACG CCGTCGCGCC CTCCGACGCT
GTCGCCGCGA ACGATTTTAC CGGCAATACC GGCTACACTC AGTTCACCCA GGAGGTGCGG
CTCGCCTCGA AGCTTTCCGG ACCGGTCAAT TTCCTGGTCG GGGCGTTCTA CGAAGACTCC
GACCTGGAGA CTTACACCCG CAACATTCTG GCTGGCGCGC CGCTCTTCCT GCACAACATA
GACGGCACGA CGCAGTCGGT GTTCGGCCAG GGGATCTGGG ACATCACCGG CAAGCTCGAA
CTTGCCGGCG GTTTACGCTG GAGCAAGGAA GCCAAGGATT TCGCGGTCAG TCGAAACGGC
GCGCCGCAGC CGGTGTCGCC TGACTCCGCC GATTTCAAGA ACACTTCGCC TGAGGTGACG
CTGACATGGC GGCCGACCGA GCGCCTGACC CTGTACGGCG CTTACAAGCA AGGCTTCAAG
TCGGGCGGTT TCGCGGCGGC CACCAACACG GGCGCCGCCT TCACCACGCC GCTCAATGCT
CTGTACCTGC CTGAGTCAGC CGAAGGATTC GAGGGCGGCT TGAAAGCGCT ATTGTTCGAC
GGCGCTCTTC GCCTGAACAC GGCGGCCTAT GACTACGACT ACACGAACCT GCAGGTGAAC
TCCCTCGACA ACTCGAGCGG CGTGCCCGTG ATCCGCGTGA ACAATGCCGG CGCGGCGACC
GTGAAGGGCG TCGAGGGCGA CTTCACGCTG AAGCTGGCCG GGGCGCCCGG CCTGACGATC
CGCGGCGCGG CCAACTACAA CGACGCCAAG TATGCCAATT TCCTGGCGAC CTGTTACATC
GGCCAGACCG TCGCGGATGG TTGCAACCTG CTTTTGAACC CGCTTACCGG ACGGTATACG
GGCCAGCAAC TGGCGGGCCA CCGGCTCGTG AACGCCCCAG AATGGACCGG TTCGTTGGGC
GGAGCCTACA CGGGCAAGAC CTTCATCGAA GGGATCGACT GGGGCATGAA CGTCGATGGG
CTCTACAAAT CGCGATACAA TCCGCACCCG GAACTGCACC CCGGCGCCCA GCAGGACGGG
GTCATCTTCC TGAACGCCGA CGTTCGCGTG TTCCGCGATG ACCATCGCTG GGAGCTTGCC
CTGATTGGGC GCAATCTGAC CGAGGAATAT CGGGTCGACG TGGCGTCCAA CGTGCCCCAG
ACCGGCGTGG CGACGCGCAC CGGATCGGCC CTGACGGGCG GCCTCGCGGA CCTGAGCGGC
AATGTCAATC GAGGTCGCGA GGTCATGCTG CAGCTTACGT TCCGACCGTT CTGA
 
Protein sequence
MIITKTGVSC AALAWALASW LSPAAFAQTQ TQPAPEAATA VDDVVVTARR RDEAPISVPV 
SVTVLGAAQL KMLAVDSLQD LRAVTPGISV GEVSGGVGGT VALRGVGTTA GSNPTFEQTV
AINVDGVQLS RGGAVRVGQI DMEKIEVLRG PQALFFGKNS PAGVISITTA DPTSTFESEV
RGGYEFNANE RQLEATISGP LTTTLGARLV VYGSDMDGWR DNTADAAAAA ANAIRPGSVT
GSTRSGPQQK FFFTRGTLKW RPNDAFDARL KLSYADNKGI GYNQAGGFQR IYCPSGAPQL
APQSTALNGG AANPALAAAL SVDNCRADAT YANGNINPAF LVGSPEGFTD PDGAGNYSQQ
LYSLEMNYRP NDWLALTSVT SWAKNEDFRA DTYAVAPSDA VAANDFTGNT GYTQFTQEVR
LASKLSGPVN FLVGAFYEDS DLETYTRNIL AGAPLFLHNI DGTTQSVFGQ GIWDITGKLE
LAGGLRWSKE AKDFAVSRNG APQPVSPDSA DFKNTSPEVT LTWRPTERLT LYGAYKQGFK
SGGFAAATNT GAAFTTPLNA LYLPESAEGF EGGLKALLFD GALRLNTAAY DYDYTNLQVN
SLDNSSGVPV IRVNNAGAAT VKGVEGDFTL KLAGAPGLTI RGAANYNDAK YANFLATCYI
GQTVADGCNL LLNPLTGRYT GQQLAGHRLV NAPEWTGSLG GAYTGKTFIE GIDWGMNVDG
LYKSRYNPHP ELHPGAQQDG VIFLNADVRV FRDDHRWELA LIGRNLTEEY RVDVASNVPQ
TGVATRTGSA LTGGLADLSG NVNRGREVML QLTFRPF