Gene Caul_3715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3715 
Symbol 
ID5901171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4010073 
End bp4012268 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content65% 
IMG OID641564226 
ProductTonB-dependent receptor 
Protein accessionYP_001685340 
Protein GI167647677 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.62728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT CCAACAAGCT GCGCAGCCGC CTCCACCGCG GCGCCGCGCT CACCGTCCTG 
GCCCTCGCCG TCGGCGTCGC CGTCCCGGCC TTCGCCCAGG ATACGGGCAA CGTCACCCTG
GATGACGTGA TCGTCACCGC CCAGAAGCGG TCGGAAAACG TCCAGGAAAT CCCCGTTTCG
GTCGCGACCA TGTCCGGTGA AAAGCTGGGC GACGTGCTGG CCGCCGGCGA GGACATCGTG
GGTCTGTCGA GCCGCGTGCC TGGCCTGTAC ATCGAATCCT CGAATGGCCG CGCCGCGCCG
CGCTTCTACA TCCGGGGCCT CGGCAACGTC GACTTCGACC TGGCCGCCTC GCAGCCCGTT
TCAGTGATCA TCGACGACGT GGTCATGGAA AACGTGGTGC TCAAGAGCAC CCCGATCTTC
GACGTCCAGC AGGTCGAAAT CCTGCGCGGT CCGCAAGGCA CGCTGTTTGG CCGCAACACC
ACCGCCGGCA TCGTCAAGTT CGACTCGGTC AAGCCGAGCC AGGACTTCTC GGCAACCGGC
ACGGCCACCT ACGGCACCTA TGGCACCGCC ACGTTCGACG GCGGAGTCGG CGGCGCGTTG
GTCAAGGACG TCCTGTCCGG CCGCCTGTCG GTGTTGGCCC AGCACCGCAA CGACTACATC
GACAATGGCT TCAATGGCGA AAAAGACGCC TTGGGCGGCT ATGACGAATA CGCCATCCGC
GGCCAGTTGC TCTATACGCC GACCGACAAG TTCAGCGCCC TGCTGAACCT GCACAACCGC
TCGCTGGACG GCACCGCCGC GATCTTCCGC GCCAACATCC TCACCACCGG CAGCAACAAG
ATCAACGGCA ACTTCAAACG CGACAAGGTC TTCTACAACG GCGGCAACAA CAACCCGCAG
AAGTTCGACG GCAACGGCGC GTCGCTGAAG ATGGACTACG ATCTGGGCGG CGCCAAGCTG
ACCTCGATCT CGGCCTACGA GACCACCAAC GGCTATAGCC GCGGCGACAT CGACGGCGGC
GTGGCCGGGG TCGGCCCGGG CTTCATCCCG TTCGACTCCG CCTCGGCGGA CGCCATCGAC
CTGGACCAGT ACACGCAGGA AATCCGCCTC GCCAGCGACG ACGCTTCGCC GCTGACCTGG
CAGGTGGGCG CCTACTACTT CAAGTCCAAG TTCTCGGTGG CCAGCGATCC CGGCTTCGCG
CCGCCCTCGA CCATCGAGCA CAAGAACACG GCTTGGGCGG TGTTCGGCCA AGCCTCTTAC
AAGATCTCTG ACGACCTGAA GATCACCGGT GGCCTGCGCT ACACCAGCGA CGACAAGGAC
ATGTCGGTGT TGAGCTCGCC GTTCGGCATC CCCGCGCCGG TCTCGGTGTC GGACGAAAAG
GTCAGCTGGG ACCTGTCGGC CTTTTACGAC GTCGCGCCCG ACGTCAGCCT CTATGCCAAG
GTCGCCTCCG GCTTCCGCGG TCCGTCGATC CAGGGCCGCG ACATCGCCTT CGGCAGCGCC
TCCTCGATCG CCCAGTCGGA AACGATCATG TCGTACGAGG CGGGCCTGAA GAGCGAGCTG
CTCGACCGCC GCGTGCGCCT GAACGGCGCG GTGTTCGCCT ATGAGGTCAA GGACCTCCAG
CTCAGCGCCA TCGGCGGCGG CAGCAACTCC AACCGCCTGA TCAACGCCGA CAAGGGCCAA
GCCTACGGCT TCGAACTCGA CGGCGAATGG GCGGTCAACG AGAACCTGCT GGTGACGGCC
GGCTACAGCT ACAACCACAC TGAAATCAAG GACAGCGGGC TGACCACCGC CGCCTGCGGT
TCGGGTCAGT GCACGGTGAC CGACCCGACC ACCACCGGCG GCCTGGCCCT GATCAACGGC
AACCCGTTCC CGAACGCGCC GAAGTCGATC CTGACCTTCA CGGCCCGCTA CAGCTATCCG
ATCGGCGACG GCGAACTGTT CGCCTACACC GACTGGTTCC GTCAGGGCTA CACCAACATC
TTCCTGTACG AGAGCAAGGA GTACCACACC AACGGCGACT TCGAAGGCGG CCTGAAGCTG
GGCTACGCCA AGTCCGACGG TGCGTATGAA GTGGCGCTGT TCGCCCGCAA CATCACCAAC
GAGGTGAACC TGCGCGGCGG CATCGATTTC GACAACAACA CCGGCTTCGT CAACGAGCCC
CGGATCGTCG GCATCTCGAT CAGCGCCAAG CGGTAA
 
Protein sequence
MTTSNKLRSR LHRGAALTVL ALAVGVAVPA FAQDTGNVTL DDVIVTAQKR SENVQEIPVS 
VATMSGEKLG DVLAAGEDIV GLSSRVPGLY IESSNGRAAP RFYIRGLGNV DFDLAASQPV
SVIIDDVVME NVVLKSTPIF DVQQVEILRG PQGTLFGRNT TAGIVKFDSV KPSQDFSATG
TATYGTYGTA TFDGGVGGAL VKDVLSGRLS VLAQHRNDYI DNGFNGEKDA LGGYDEYAIR
GQLLYTPTDK FSALLNLHNR SLDGTAAIFR ANILTTGSNK INGNFKRDKV FYNGGNNNPQ
KFDGNGASLK MDYDLGGAKL TSISAYETTN GYSRGDIDGG VAGVGPGFIP FDSASADAID
LDQYTQEIRL ASDDASPLTW QVGAYYFKSK FSVASDPGFA PPSTIEHKNT AWAVFGQASY
KISDDLKITG GLRYTSDDKD MSVLSSPFGI PAPVSVSDEK VSWDLSAFYD VAPDVSLYAK
VASGFRGPSI QGRDIAFGSA SSIAQSETIM SYEAGLKSEL LDRRVRLNGA VFAYEVKDLQ
LSAIGGGSNS NRLINADKGQ AYGFELDGEW AVNENLLVTA GYSYNHTEIK DSGLTTAACG
SGQCTVTDPT TTGGLALING NPFPNAPKSI LTFTARYSYP IGDGELFAYT DWFRQGYTNI
FLYESKEYHT NGDFEGGLKL GYAKSDGAYE VALFARNITN EVNLRGGIDF DNNTGFVNEP
RIVGISISAK R