Gene Caul_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1838 
Symbol 
ID5899293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1950541 
End bp1953480 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content66% 
IMG OID641562328 
ProductTonB-dependent receptor 
Protein accessionYP_001683465 
Protein GI167645802 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.534381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00298176 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCATAT CCATTCATTC AAACGCCCTG CGACTGGCGC TTATCGGCGC CAGCTGCCTG 
ACCGGCCTCG CCGCCGCGCC CGCCTTCGCC CAACAGACGC CCGCGCCAGC CGCCAGCGCG
GATGCGGTCG AGGAAGTCGT GGTCACGGGC TTTCGCAAGA GCCTCGCGGA CGCCACCAAC
GCCAAGCGCG ACAGCATCGC CTTCACCGAC TCGGTGTTCG CCGAAGACAT CGGCAAGTTC
CCGGATCTGA ACATCGCCGA GTCGCTGAAC CGCATTCCCG GCATCCAGCT GACCCGCGAA
ATCAACGGCG ACGGCCTGAA CATCGCCATC CGCGGCCTGG GCACCGACTT CACCAAGATC
GTGCTGAACG GCGCCCAGAT CGGTGTGGCC TCCAGCGGCC GGACCGACGC CCAGAACCAG
AACCGGCAGG TCGACCTCGA CCTGTTCCCG ACCGAACTGT TCACCCGCCT CGACGTCAGC
AAGACGCCAA TGCCCAGCCA GCTCGAAGGC GGCGTCGCGG GCATCGTCAA CATGCGCAGC
TCGCGACCGC TGGACCGGCC CGGCCAACAC CTCACCTACT CGCTGCAAGG CGCCTATCAG
GACTCCGCCG GCAAGTGGAG CCCGCGCGGC GCCCTGATCG GCAGCAAGTC GTGGGATGTC
GGCGACGGCG AGTTCGGCCT GCTGGTCGGC TACGCCGGCG CGCGCTCCAA GAGCCGCACG
GACGGGTTCG AGACCATCGG CTGGACCAAC GCCAGCACGG GCGGCTCGAG CAACTTCGCC
TGCGGCGGCT GCAACTCGAC CTTCGGCGGC AACGGCTTCA CCTGGGCGCC GACCGTTCCG
GCCAACGCCG GCAACGGCCT GACCACGGGG GCGACCGTCA ATGACGCCTT CCTGCAGGCC
AACAACCCCG GCACGACCCT GCAGCAACTG AGCGACGGCC TGCTGCCGCG CCTGGGCCGC
CAGTCCTACA GCGCCGGTCA TCGCGACCGC GACTCGCTGC TGGTCTCGCT GCAGTACCAG
CCCAACGACC ACGCCGACTT CTACATCGAC ACCCTGCTGG GCAAGACCAA CCGCGAGTTC
AGCCGGATCG ACATGGACTG GGTGGTCCGC AATTCGAACT TCATGGTGCC GACCAACGTC
AAGGTCGGCG CCAACAACGT GATCACCAGC GGTACGTTCG CCAACTCGCA GTTCTTCCTC
GAGGCGCGCC CCTATCACGA GACCAACAAG TTCGTGAACG TGAACCCGGG CGGCAGCTGG
CGCTTCAGCG ACACCCTCAA GCTGGACGGC CAGTTCAACT ACAGCCGCAG CGTCTTCTTC
CGCGAGGCGC CGACCATCCT GATCAACACG CCGCTCAACA GCGGCCTGAC GGTCACCTAC
GACAACACCG GCGGCGATTT CCCCAGCATC AAGACCAGCG CCAACCTCAA CGACCCGAGC
CTCGGCTGGA CCTTCGTCGG CGGCCGGGTG AACATCCAGA ACGAGAAGCG CGTCACCTCG
ACCAAGGGCA CGCACTGGGA CCTGACCTGG GGCGACGAGC GCAACTACAT CAAGGGCGGC
GTGGCCTATG ACGAGGCCTC GCGCTCCATC ATGGCGCTCG ACAATAGCGA CCGCTGGCAG
CAGATCACCT GCGGCGGCGG CGGGACCTAT CTGCCGCGTC CCAACACCCA ACCCGCCTGC
ACCGGCGGCG CGGGCTCGGC GATCACCAAC GCCCAGTTGG CCTCGTACCT GAAGCCCGGT
CCGCTCGGCT TCATCACCGT CGACTACGAC AAGTTCAAGG CCGCGACCAA CTATCAGGCC
CTGAACGACA CCGCGCCGTT CAGCAGCTCG GCCGCCACGG CCGCCAACTC TGGCGAGATC
GAGGAAAAGA ACACCGGCGC CTATATCGAG TTCGCCGGCG TCGCCACCAT CATGGACCGC
GAACTGCGGA TCGCCGGCGG CTCACGCTAC GTCTCGACCG ACCAGGACGT CACCGGCCCG
GTCTCGATCC CGTTCCCGAA CGTCGCCAAC TGCACGCCCA ACTGCGTGCC CAACACCCTG
ACGTTCAAGA CGACCTCGCA GCGGTATGAC GCCTTCCTGC CGTCGTTCAA CGCCGTCTAC
GCCGTGCGCG ACAACATCAA TCTGCGGATG TCCGCGTCGC GCACCCTCAC CCGTCCGGAC
CCCAGCGCCA TGCTGCCGGG CACCACGTTC AGCGACCCGT CCGCCCAGAA CGCCAACCAA
GGCAACCCGG CGCTGCGCCC ATACACCTCC AACAACTTCG ACGTGGGCGG CGAGTGGTAT
ACCGGCGGCG CGGGCTATGT TGGCGTGGCC CTGTTCCAGA AGGTGGTCAC GGGCTTCACC
GCCGTGGGCG CCACCACCCA GCCGTTCACC GCGCTGGGCA TCCCGTTCGA CAGCCTGACC
GATCTGCAGA AGACCGCCAT CAATAACCGT GGCGGCCCCA GCGCCGCGAC GGTGACGGTC
AGCCAACAGG TCAACACCGG CTCGGACCTG ACCATCCGCG GCTACGAGCT GAACTGGGTT
CAGCCCCTGG ACTTCGTGTT GCAAGGCGCC GGCTTCACGG CCAACTACAC CCGCGTCAAC
CAGACCGGCA CCGGCGGCGT CGTGGCGCTG GGCGTCTCGC CCTACACCTA CAACCTGACG
GGCTACTACG AGAACCACGG CGTGACGCTG CGGGTGTCCT ACAACTACAA CGACGCCCAG
ATCAGCTCGG GCTTGAACCA GAACAGCGTG CCGACGGCGC GGATCAAGAC CGACGCCTAC
AAGCAGATGG ACCTCTCGGC CAGCTATACC CTGCCCATCC TGGGCGGCGC GCAGATCACC
TTCAACGCCA TCAACATCAC CAGCGAAACC CAGCGGCAGA CCTTCCAATA CCCGAACGCC
GCCTACACCT TCTACGATCC AGGTCCGACC TATCTGATCG GCATCCGCGG TCAGTTCTAG
 
Protein sequence
MAISIHSNAL RLALIGASCL TGLAAAPAFA QQTPAPAASA DAVEEVVVTG FRKSLADATN 
AKRDSIAFTD SVFAEDIGKF PDLNIAESLN RIPGIQLTRE INGDGLNIAI RGLGTDFTKI
VLNGAQIGVA SSGRTDAQNQ NRQVDLDLFP TELFTRLDVS KTPMPSQLEG GVAGIVNMRS
SRPLDRPGQH LTYSLQGAYQ DSAGKWSPRG ALIGSKSWDV GDGEFGLLVG YAGARSKSRT
DGFETIGWTN ASTGGSSNFA CGGCNSTFGG NGFTWAPTVP ANAGNGLTTG ATVNDAFLQA
NNPGTTLQQL SDGLLPRLGR QSYSAGHRDR DSLLVSLQYQ PNDHADFYID TLLGKTNREF
SRIDMDWVVR NSNFMVPTNV KVGANNVITS GTFANSQFFL EARPYHETNK FVNVNPGGSW
RFSDTLKLDG QFNYSRSVFF REAPTILINT PLNSGLTVTY DNTGGDFPSI KTSANLNDPS
LGWTFVGGRV NIQNEKRVTS TKGTHWDLTW GDERNYIKGG VAYDEASRSI MALDNSDRWQ
QITCGGGGTY LPRPNTQPAC TGGAGSAITN AQLASYLKPG PLGFITVDYD KFKAATNYQA
LNDTAPFSSS AATAANSGEI EEKNTGAYIE FAGVATIMDR ELRIAGGSRY VSTDQDVTGP
VSIPFPNVAN CTPNCVPNTL TFKTTSQRYD AFLPSFNAVY AVRDNINLRM SASRTLTRPD
PSAMLPGTTF SDPSAQNANQ GNPALRPYTS NNFDVGGEWY TGGAGYVGVA LFQKVVTGFT
AVGATTQPFT ALGIPFDSLT DLQKTAINNR GGPSAATVTV SQQVNTGSDL TIRGYELNWV
QPLDFVLQGA GFTANYTRVN QTGTGGVVAL GVSPYTYNLT GYYENHGVTL RVSYNYNDAQ
ISSGLNQNSV PTARIKTDAY KQMDLSASYT LPILGGAQIT FNAINITSET QRQTFQYPNA
AYTFYDPGPT YLIGIRGQF