Gene Caul_4082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4082 
Symbol 
ID5901544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4424137 
End bp4426851 
Gene Length2715 bp 
Protein Length904 aa 
Translation table11 
GC content66% 
IMG OID641564602 
ProductTonB-dependent receptor 
Protein accessionYP_001685704 
Protein GI167648041 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTCTC ACTACAAGCT ACTCGCCGCG GTCAGTGGAC TGGCCCTGAT GGCGGCCGCC 
GGCGCCCACG CGCAAACGGC CGCGCCCGCC AACGACGCCG CGCAGGTCGA CGAAGTCGTC
GTCACCGGCG TCCGGAAGAG CCTGCGCGAC GCCCTGCAAG TGAAGCAGGG CTCGGACAAG
GTGGTCGAGG CCATCTCGGC CAAGGACATC GGCGTGCTGC CCGACGTCAC CATCGCCGAA
TCCATCGCCC GCCTGCCCGG CGTCAACGCC ACCCGCGACC GCGGCAACGA CAGCCAGGCC
GTCGTGCGCG GCCTGGGCGC GCGCCTGGTG CTGGGCACCA TCAACGGCCG CGAAGTCGCC
TCGTCAGAGC CCGACCGCAA CGTGCGCTGG GAAATCTACC CTTCCGAAGT CGTCCAGGGC
GTCCAGGTCT ACAAGTCGCA GTCGGCCGAC CTGATCGCCG GTGGCGTGGC CGCGACCATC
AACATCGACA CCATCGCCCC GCTCGACTAT CGCGGTCCCA GCGTCGTGCT GCGCGCCGGC
CCAGTCTATT ATGACGGCGG CAAGGACATC CCCAACTACG GCCAGACCGG CTACCGGGCC
AGCGGCTCGT TCGTGCACAA GTTCAACGAC GACCTGGCCA TCGTGCTGGG CCTGACCAGC
CAGAAGCAGA AGAACGGCTA CACCTCGTTC CAGGGCTGGG GCTACAACGA CTCGGTGATG
CGCCAGCCCG CTGGGGCTAG CGACTACAGC GGCGACCTGA ACGGCGACGG CAAGGTCGAT
CCCACCCCGT GGGGCATGCA GCTGGAGATC AAGAAGATCG ACCAGAAGCG CAACGGCGTG
TCGACCGGCC TGCAATGGAA GCCGACCGAC CACTTCGAGC TGAAGGCCGA CGTCCTCTAT
TCCGACATCA AGATCACGGA AAACCAGGAC CAGCAGATCT ACGCCCAGAA CTACGGCAAC
TGGAACAACG GCAATGCCTT CGACTGGCAG GGTAATCCCA TCGGCTACAA CGCCCCGGGC
GCGTCCTACA CCCTGGTCAA CGGCGACGTG GTCGCCGCCA CCCTGCCCGG CGCCGCCGTC
ACCTCGGTGA TCGCGCGCTA TACCGAGGAC AAGAAGCTCT ATGCCGGCGG CCTGAACGGC
AAGTGGACCA ACGACGCCTG GACCGTGGCC GGCGATGTCT CCTATTCGAA GGCCGAGCGG
ACCAACAACT GGCGGGCTGT GCGCGCCGAG GTCTATCCGG CCTGGATGAC CTATGACACC
CGCGCCGGCG TCAAGCCCAG CGTCACCACC TCGGAAGACC CAACCACCCT CGCCCAGGTG
GCCCCCAGCT GGCGCGGCGG CCAGAACGAC GGGCTTGAGC ACCTGAACGA TGAGTTGAAG
GCCGGCGCCC TGGACTTCAC CCGCGACTTC GGCGGCGGGG CGTTCAAGAG CTTCCAGTTC
GGCGCGCGCT ATTCAGACCG GGTGAAGGAT CACGACCAGG CCAGTTGGTC CACCTGCCCC
AACCCGACCA ACACGGCCAA CCTGGCCGGC CTGAAGGACC AGTGGGGCAA CTGCCTCTAC
TCGGTCACCC TGCCGGCCAG CCTGTTCAGC ACCTACAAGA TCGGCAGCTT CAACGTGCCG
AGCATTTTGA CCGGCGACCT GGACGCCATC GCCAAGGCCG CCTACGGCGA CCACGGCTTC
GACGCCGCCA ACGCTGTCGA CAACCTGGCC CAGCGCTGGC GCGTCCACGA GAAGGTGGCC
GAGGCCTATG GCAAGCTGAA CTTCGCCGCC GACGGCGTCG CCGGCGCCTG GATGACCGGC
AATGTCGGCG TCCGCGTGGT CAGCACCAAG ACCGACAGCG AGGGCTATCG CCAGGACCCG
GGCCTGGCGA CCTTCTCGGC CGTCTCGGTC AAGGCTGACT ATACCGACGT GCTGCCCAGC
GCGAACGTCA AGCTGGACTT CGACCAGGGC CGCGTGCTGC GCTTCGGCCT GGCCCAGGTG
GTGGCCCGCC CGCCGCTCGA CGAGCTGCGC GCCAGCCGCA CCCTGACCAC CTGGTCGCCC
TATACCGGCT CGGCCGGCAA CCCGAACCTC AAGCCGTTCA AGGCCATCCA GTTCGACGCC
TCGGCCGAGT GGTACTTCCG TCCCGAGAGC CTGGTGGCCG CGTCCTACTA CTACAAGGAC
GTCGATACCT ATATCGGCTG GAAGCAGACG CCCGAGACCT ACAACGGGAT CACCTACGCG
GTGTCGAGCC CGGTCAATGG CGGCGGCGGC TACATCCAGG GCCTGGAGCT GACCTTCCAG
ACGCCGTTCT TCTTCCTGCC GGGACCGCTG AGCAAGTTCG GGATCTATTC GAACTACGCC
TATGTCGACT CCGACCTGAA GGAGTTCCAG CCGGTCACCA AGCCGCTGTC CCTGACGGGC
CTGGCCAAGG ACACCGTGAC CCTGGACCTG TGGTACGCCA ACGGCCCGAT CGAAGGCCGC
ATCGGCTACA AGTACCACAG TCCGATGACC GTGATTTACG GCTGGAGCGG CGCGGACCTG
CAGACCCTGG AGTCGGCAAG CACGGTCGAC TTCAGCTCGT CCTACCAGGT CACCGACAAG
ATCGGCCTGC GCTTCCAGGT CAACAACCTG ACCAATGAGC GCCTGCGGAT GTATCGCGAC
AACAAGCCCG ACCGCCTGGG TCGCTACGAC CTTTACGGCC GCCGCTTCCT GTTCGACGTG
ACGGTGAAGT TCTAG
 
Protein sequence
MMSHYKLLAA VSGLALMAAA GAHAQTAAPA NDAAQVDEVV VTGVRKSLRD ALQVKQGSDK 
VVEAISAKDI GVLPDVTIAE SIARLPGVNA TRDRGNDSQA VVRGLGARLV LGTINGREVA
SSEPDRNVRW EIYPSEVVQG VQVYKSQSAD LIAGGVAATI NIDTIAPLDY RGPSVVLRAG
PVYYDGGKDI PNYGQTGYRA SGSFVHKFND DLAIVLGLTS QKQKNGYTSF QGWGYNDSVM
RQPAGASDYS GDLNGDGKVD PTPWGMQLEI KKIDQKRNGV STGLQWKPTD HFELKADVLY
SDIKITENQD QQIYAQNYGN WNNGNAFDWQ GNPIGYNAPG ASYTLVNGDV VAATLPGAAV
TSVIARYTED KKLYAGGLNG KWTNDAWTVA GDVSYSKAER TNNWRAVRAE VYPAWMTYDT
RAGVKPSVTT SEDPTTLAQV APSWRGGQND GLEHLNDELK AGALDFTRDF GGGAFKSFQF
GARYSDRVKD HDQASWSTCP NPTNTANLAG LKDQWGNCLY SVTLPASLFS TYKIGSFNVP
SILTGDLDAI AKAAYGDHGF DAANAVDNLA QRWRVHEKVA EAYGKLNFAA DGVAGAWMTG
NVGVRVVSTK TDSEGYRQDP GLATFSAVSV KADYTDVLPS ANVKLDFDQG RVLRFGLAQV
VARPPLDELR ASRTLTTWSP YTGSAGNPNL KPFKAIQFDA SAEWYFRPES LVAASYYYKD
VDTYIGWKQT PETYNGITYA VSSPVNGGGG YIQGLELTFQ TPFFFLPGPL SKFGIYSNYA
YVDSDLKEFQ PVTKPLSLTG LAKDTVTLDL WYANGPIEGR IGYKYHSPMT VIYGWSGADL
QTLESASTVD FSSSYQVTDK IGLRFQVNNL TNERLRMYRD NKPDRLGRYD LYGRRFLFDV
TVKF