Gene Caul_4698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4698 
Symbol 
ID5902160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5081267 
End bp5083753 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content65% 
IMG OID641565217 
ProductTonB-dependent receptor 
Protein accessionYP_001686316 
Protein GI167648653 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.648479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACGA AAAAGCTGGC CTGTCTGGCG TCGACCGCTC TGGTCGGCTG CCTGCTGAGC 
GCCACCGCCG CCATGGCCCA ATCGACCGGC TCGCAAGCCA ACGAAGTCGA TAGCGTGGTT
GTAACCGCCG CCGGAGCCCG CGCCGTCGCC GGCCAGATCG TAGAGACGTT GCCGAAGTCG
CGCGCCTCGG TCGACGCCGC GTTCCTGGCG ACGCAATCCA CCGGCCAGAA CGTGTTCCAG
TCGCTGAACC TGTTGCCGGG CGTCAGCTTC ACCAACAACG ACCCCTACGG TTCGTCGGGC
GGCAACCTGC GCCTGCGTGG CTTCGACGGC GCTCGCGTCT CGGTCACGTT CGATGGCGTT
CCGCTGAACG ACACCGGTAA CTACGCCGTC TATCCGAACC AACAGCTGGA CGCCGAACTG
ATCGACCGCG CCAGCGTGAA TCTGGGCACG ACCGACGTCG ACAGCCCGAC CGCGTCGGCC
ACCGGCGGCA CCATCAACTA CATTACCCGC AAGCCGGCCC ATGAGTTCGG CGGCATCGCC
GACGCCTCGA TCGGCGAAGA TAACTATCGC CGCGGCTTCC TGATGGTAGA CACGGGCGAT
ATCGGTCCGT TCGGTACCCG CGCCTTCGTG GCCGGCTCCT ACCAGAAGTA CGACAAGTGG
AAGGGTCCGG GCTCGCTCGA GAAAAAGCAA GTCAACGCCC GGATCATGCA GGACATCGGC
AATCGCGGCG ACTTCGTCAG CCTCGCGGTG AACTACAATG AGAACCGCAA CAACAACATC
CGTCAGCTGT CGCTGTCCGA CTTCCGCACC TTCGGCAAGA ACTACGACTA CGACGCGGTC
TGTAACCCGG CCGCCGTCAG TGGCGTCGTC GTTCCCGGCA ACTGCACCAA CTACTACGGC
CGTCAGGTCA ACCCGTCGAA CACCGGCAGC ATCCGCGGCT CGGCCCTGTT CCATCTGGCC
GACAACATCC GTCTGACGAT CGACCCGTCG TTCCAATACA CCCTGGCCGA CGGCGGCTCG
CAGCTGGCCA CCGTCGCTGA AACAGATGGC CGGGTCCGTG GGACCGTCGC CGGCGCGCAA
GGCCCGGCTA AGGACCTGAA CGGCGACGGC GACGCCGTGG ACACCGTGGC GTTCTTCGCG
CCCAGCGTCA CCAACACCCG CCGCTACACC GTCACCAGCT CGCTGATCTG GGACCTGAAT
GACGACAACC GCGTCCGCGT CGCCTACACC GGCGACTACG GCCGTCACCG CCAGACCGGC
GAGTACACGA CGCTCGACTC GCAAGGCAAC ACGACCGACG TGTTCGGCGG CAAGGAAGGC
CACGGTGCCA AGGTCCTGAC CTCGGACGGC AGCTTCTTGC GCGCTCGTGA CCGTTTCTCG
ATCGCCCAGC TGAACCAGGT TGCCGCCGAA TACCGCGGCA AGTTCATGGA CAGCCGCCTG
ACCGTGAACC TCGGCATCCG CGCGCCCTTC TTCAAGCGCG AGCTGAACCA GTACTGCTAC
ACGCAGAACG CCACCGGCCT GACGACTGTG GTTTCAGGCT TCACGGTGCT CTGCACCACC
CAGACCCCGG TCGTCACCAA CGCCGACGGC ACGGTGCAGT TCGCCCCCAA CGGCACGACC
ACCGGCGCCG CCCTCGCCAA CCTGCGCTAC ATCAAGCCGT GGTCGGCCAC CGTGAAGTAC
GACAAGGTCC TGCCGAACGC CGGCGCGACC TATGACATCG GCGGCGGCAG CACCGTCTAT
GTCAGCTACG CCGAGGGCTT CTCCTCGCCG CGCACCGACA ACCTCTACAC TGCAACCCTG
GAGAACAAGG TCGGCACTCC GGCCGACACC CGTCCGGAAA CCACCAAGAC CTACGACCTG
GGCTATCGCT TCGCGAGCCC GACGGTCATG GCCACGGCCG CGGTCTGGAA GACGGATTAC
AAGAACCGCA TCGTGCAAGC CTATGATCCG GATCTGAACA TCAGCATCGA CCGCAACGTC
GGCGCGGTTA AGGCCTACGG CCTCGACACC CAGGCCGCCT GGGCCGTGGC CGAATACCTG
ACCGTCACGG GTTCGTTCTC GTACAACAAG AGCGAAATCC AGCAAGATCT GCAGGTCAAC
GCCGCCGGCG CCACCATCCC GCTGTCGGGC AAGCAGGTCG TCGAAACCCC GAAGTACACC
TTCGGCGGCC GCGTCGACTG GGACGTCACC GAAGCGCTGC ACCTGGGCGT CCAGGGCAAG
TACACCGGCG ACCGTTTCTC GACGGACGTG AACGACGAAG TGGCTCCGCA CTACACCGTG
TGGGACATGT CGCTGGAGTA CGACCTGCCG TTCGCCAAGA AGACCTACGC CCAGCTGAAC
GTGAACAACC TGTTCAACGA AACCTACTTC GGTTCGATCA GCTCGCGTAC GAACGCCTTG
GCGCTTACGG GGTCGTCGGC AAGCGCGCCA ACCTACTACA TCGGCTCGCC GCGCACGGTT
CAGTTCACCC TGCGCACCGA GTTCTAG
 
Protein sequence
MMTKKLACLA STALVGCLLS ATAAMAQSTG SQANEVDSVV VTAAGARAVA GQIVETLPKS 
RASVDAAFLA TQSTGQNVFQ SLNLLPGVSF TNNDPYGSSG GNLRLRGFDG ARVSVTFDGV
PLNDTGNYAV YPNQQLDAEL IDRASVNLGT TDVDSPTASA TGGTINYITR KPAHEFGGIA
DASIGEDNYR RGFLMVDTGD IGPFGTRAFV AGSYQKYDKW KGPGSLEKKQ VNARIMQDIG
NRGDFVSLAV NYNENRNNNI RQLSLSDFRT FGKNYDYDAV CNPAAVSGVV VPGNCTNYYG
RQVNPSNTGS IRGSALFHLA DNIRLTIDPS FQYTLADGGS QLATVAETDG RVRGTVAGAQ
GPAKDLNGDG DAVDTVAFFA PSVTNTRRYT VTSSLIWDLN DDNRVRVAYT GDYGRHRQTG
EYTTLDSQGN TTDVFGGKEG HGAKVLTSDG SFLRARDRFS IAQLNQVAAE YRGKFMDSRL
TVNLGIRAPF FKRELNQYCY TQNATGLTTV VSGFTVLCTT QTPVVTNADG TVQFAPNGTT
TGAALANLRY IKPWSATVKY DKVLPNAGAT YDIGGGSTVY VSYAEGFSSP RTDNLYTATL
ENKVGTPADT RPETTKTYDL GYRFASPTVM ATAAVWKTDY KNRIVQAYDP DLNISIDRNV
GAVKAYGLDT QAAWAVAEYL TVTGSFSYNK SEIQQDLQVN AAGATIPLSG KQVVETPKYT
FGGRVDWDVT EALHLGVQGK YTGDRFSTDV NDEVAPHYTV WDMSLEYDLP FAKKTYAQLN
VNNLFNETYF GSISSRTNAL ALTGSSASAP TYYIGSPRTV QFTLRTEF