Gene Caul_5396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5396 
Symbol 
ID5897195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp106916 
End bp109432 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content64% 
IMG OID641550686 
ProductTonB-dependent receptor 
Protein accessionYP_001672172 
Protein GI167621664 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTTA CCCAGGTGCT TTGCGGCACG GCGTCCGCAG CCGTGCTCGC CGGTCTGTCC 
GTCGCGAGTG TCGCGCAGGC CCAACAAACC GAAACGATGA CCGTCGACAG CATCGTCGTC
ACCGCCCAGA AGCGCGAGCA GAACCTGCAG GACGTGCCGG TCGTGGTCAC CGCCGTCGGC
GCCAAGCTGC TGCAAGACAC CGGCGTCAAG GACATCAAGG ATCTGACGAT CCTGACCCCC
GGTCTGACCG TCACCTCGAC CACGTCGGAA GCCTCGACCA CCGCCCGCGT GCGCGGCGTC
GGCACGGTCG GCGACAACCC CGGTCTGGAA AGCTCGGTCG GTGTCGTGAT CGACGGCGTC
TATCGTCCCC GCAACGGCGT CTCGTTCGGC GACCTGGGCG AAATGGACCG CATCGAAGTC
CTGAAGGGTC CGCAAGGCAC TCTGTTTGGC AAGAACACCT CGGCCGGCGT CATCAACATC
GTGACCAAGG AGCCCGAGTT CGGCTTCGGC GCCGCCGCTG AAGCCACCCT CGGCAACTTC
AACGCCCACG GCCTGTCGGC CTCGGTGACC GGTCCGCTGT TTGGCACCGA GACCCTGGCC
GGCCGCCTCT ATGTTGCCGC CCGAGAGCGC GACGGCTACA ATGATGTTCT GACCGGCAAG
GGTCCCAGCA AGCGCGACCA GGACCAGGAC CAGGGCTTCT ACACGATCCG TGGCCAGCTC
CTGTTCGTGC CCAATGATGA AGCCACCTTC AAGCTTATCG GCGACTACAC CAAGCGCGAC
GAGAACTGCT GCGGCGCCGT GCAGATCCGC ACCGGCCCGA CCGCGCCGAT CCTGAACGCC
CTGGCCGGCG GCGTCGCCTT GGCCCCGACC GCCAAGCCCT ATGATCGCGT CGCCTATTCC
AACCGTGGCG CGCCCTCGTC CATCGAGGAC AAGGGCATCT CGCTGGAAGG CAATATCGAA
CTGCCGATCG GCGAACTGAC TTCGATCACC GCCATCCGCA ACTGGCGCAC CGACAACGGT
CAGGACAGCG ATTTCTCGAC GGCTGACCTT GCCTATCGCA ACAAGGACGG CACCAACTAC
AGCGAGTTCA CGACCTACAC CCAGGAACTG CGCCTGGCTG GCAAGACCGG CAATCTGGAT
TGGCTGGTCG GCACGTTCCT GGCGGACGAG CGACTGGATA ACAAGGCCAA CTTCCTCTAT
GGCAATGACT ACGAGCAGTA TCTGGGCCGC CTGCTGTCGC GCTCGGCGGC CAACCCTGCC
GGCATCGGTA ACTTCATCTC GCTGCTGACC GGTCGCGCGC CGAACACCAC CTTCACGCCA
GGCCAGGGCC TGTACGACAA GTACAATCAG CGCGCCAAGA CCATCGCGCT GTTCACCAGC
AACACCTACC AGTTCACCGA CGCCTTCAGC ATCACCGCCG GTCTGCGCTA CAGCATCGAC
AAGAAGGACT TGGACACCTA CCAGACCACG TCCGATGGCG GCGTCGGCTG CGGCATCGGC
CTGTCCACCG CTGGTCAGGC CCGCATCGCC GGCATCGTGG GCGCAGCGGC CACCCCGACC
GTGGTCGGCA ACCTCTGCCT TCCCTGGGCC AACCCGCTGT TCAACGGTCG CGGCACCCAC
CAGAAGCGCA CCGACAAGGA ATGGTCGGGC ACCATCAAGG CCGCCTATCG CTTCTCGCCG
GAAGTGTTCA GCTACGCCTC TTATGCGCGC GGCTACAAGG GTGGCGGTTT CAACCTGGAC
CGCACCCAGT CGTCGAACGG CCTGCAAAGC GGCGGTCTGG GCGTCGTGCC GATCTACGAC
ACCTCGTTCG AGGGCGAGTT TGTCGACAGC TACGAAATCG GCCTCAAGAA CACCCTGTTC
AACCGCTCGG TCCTGCTGAA CCTGACGGGC TTCTATCAGA AGTTCACCGA CTTCCAGCTG
AACACCTTCC TGGGCACCTC GTTCGCCGTG GCTTCGATCC CGGAAGTAAC CTCGCAGGGC
GTCGACGCCG ACTTCCTGTG GTTCACCCCG GTGCGCGGCC TGACCGTCCA GAGCGGCTTC
ACCTACGCCA AGACCGAATA CGGCAACCAG AAGATCCCCA ACGATTCGAC CAACGCCCTG
GCCCTGCTGC CTGGTCAACG TCTGTCGCTG GCGCCGGAAT ATTCGGCCTC GGGTTCGCTG
ACCTATGAGC GTCCGGTGGG CGACAGCTAC AAGGCCCGCT TCAATGTCGG CGCCAAGTAC
TCGTCGGAAT ACAACAGCGG TTCCGACCTG TTCCCGCCGA AGTTGCAAGA GTCCTACACG
GTCGTGAACG CCCGCATCGG CGTCGGCACT GCGGATGACG CCTGGACCGT CGAACTCTGG
GGCCAAAACC TGTTCGACGA GGAATATACC CAGGTGGGCT TCAACGCCTT CCTGCAAGGC
TCGTCGGGGC TCAGTGCCAC CCAGGCAACT TATGTGCCGG CCAATGACAC GATTACCTAT
GACGCCTTCC TCGGCGCGCC GCGCACCTAC GGCGTGACGC TGAGGGCTAA GTTCTAA
 
Protein sequence
MRFTQVLCGT ASAAVLAGLS VASVAQAQQT ETMTVDSIVV TAQKREQNLQ DVPVVVTAVG 
AKLLQDTGVK DIKDLTILTP GLTVTSTTSE ASTTARVRGV GTVGDNPGLE SSVGVVIDGV
YRPRNGVSFG DLGEMDRIEV LKGPQGTLFG KNTSAGVINI VTKEPEFGFG AAAEATLGNF
NAHGLSASVT GPLFGTETLA GRLYVAARER DGYNDVLTGK GPSKRDQDQD QGFYTIRGQL
LFVPNDEATF KLIGDYTKRD ENCCGAVQIR TGPTAPILNA LAGGVALAPT AKPYDRVAYS
NRGAPSSIED KGISLEGNIE LPIGELTSIT AIRNWRTDNG QDSDFSTADL AYRNKDGTNY
SEFTTYTQEL RLAGKTGNLD WLVGTFLADE RLDNKANFLY GNDYEQYLGR LLSRSAANPA
GIGNFISLLT GRAPNTTFTP GQGLYDKYNQ RAKTIALFTS NTYQFTDAFS ITAGLRYSID
KKDLDTYQTT SDGGVGCGIG LSTAGQARIA GIVGAAATPT VVGNLCLPWA NPLFNGRGTH
QKRTDKEWSG TIKAAYRFSP EVFSYASYAR GYKGGGFNLD RTQSSNGLQS GGLGVVPIYD
TSFEGEFVDS YEIGLKNTLF NRSVLLNLTG FYQKFTDFQL NTFLGTSFAV ASIPEVTSQG
VDADFLWFTP VRGLTVQSGF TYAKTEYGNQ KIPNDSTNAL ALLPGQRLSL APEYSASGSL
TYERPVGDSY KARFNVGAKY SSEYNSGSDL FPPKLQESYT VVNARIGVGT ADDAWTVELW
GQNLFDEEYT QVGFNAFLQG SSGLSATQAT YVPANDTITY DAFLGAPRTY GVTLRAKF