Gene Caul_1908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1908 
Symbol 
ID5899363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2047294 
End bp2049561 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content64% 
IMG OID641562398 
ProductTonB-dependent receptor plug 
Protein accessionYP_001683535 
Protein GI167645872 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGTT TCAGGTGGAA AGCCGCGTTC TGTGCGGGCA CGGCGATGGT CTGCCTTCCC 
GCGATCGTAA GCGCCCAAAC CTCGCCTCAG GCCGAGCCGG TCTCGCCCAC GGGGCTCGAA
GAGGTCATCG TAACCGCCCA GCGTCGTGAA GAACGGCTAC AGGACGTACC GGTCGCGGTG
ACCGCCTTTG GTGCGCGGGA ACTGGAACGC CGCCAGATCA ACGACGTCCG CGGCCTGACC
GAGAACGCGC CGTCGATCAC CTTCACCGCG ACCCCTTATG GCAACAATGA CCTGATCCTG
GCGATCCGCG GCGTGGCGCC GGGAGGCGTC CTTCCCAATG TCGATCAGGC GGTCGGCACC
TATGTCGACG GTATCTATTA CGCGCGCCCC GAGGGGAGCA ACCTGGCCAT GGTGGACATC
GCCTCGGCCG AGGTCCTGCG CGGCCCGCAA GGGACCCTGT TTGGTCGCAA CACTATCGGC
GGCGCGCTGA ACATCACCAC CAACAAGCCG ACCTATGCCC TGGATGGATC GCTGAAGCTG
GGATACGGCA ACTACAACGC CATCACCGCG ACCGGCATCG TCAATCTGCC CATCGTCGCC
GACAAGCTGG CGGCGCGTCT AGTCTACAGC CACGTCGGAC ACGACGGCTA TGGCTACAAC
CCCTCCCTGG GGAGCGATGT CGCCGATCAG AACGACGATT TCATCCGCGC CAGCGTCCGC
GCCGACCTGT CGCCCGATCT GCGGGTGGAT GTCAGTTTCG ACCACTATGC GGGCTCCAAC
CACCAGCCTC TCTGGGTGCT AAACGCCTAT CAGGCCGGGC TGACGGCCGC GCAATACGCG
CCCTATGTCG CGGGCCCGAA ATCGCGCGTT AGCTACGCGG GTTATAATCC AACCAACAAG
TCCAAGGTCT TCGACTGGAC CGGGACGATC ACGGCAGACC TCGGATTTGC GACCCTGAAA
ACGATATCGG GCTATCGCAA CATCGATTTC GAGGGCGCCG CCGACCTGGA CGGGACGCCT
TTACCAACGG CGGACGTGCG CGCGTTCGAA CTCGACGGCG ACCAAGTCTC CCAGGAAATC
CAGCTCTACG GAACCGCCTT GGCCAATCGG CTGAACTGGA CCGCGGGCGC CTATTATTTC
CGCGAGAAAC TCCGCAACTC GCCAATCACC CGGGTGCCGG CCACGATCCA GGACAACACC
ATCCGGCCGA CCAATGAGTC GGTCAGCGTG TTCGGCCAGG TCAGTTTCGA GGTCTTGCCT
CGCCTGCGCC TCACGGCCGG CGTGCGGGCG GTCAAGGACA CCCGCGGCAT GGTCTATACG
CCGGCGCGCT TCGCCGTGGC GGCGGCCAGT TCGTCCAATC CCGAGCCGCC GGCCTCGGCG
GTGACGCCGG CGGCGTGCCC CTTCACGGCC ATCGGCCTCA ACCAGGCCCC GGGCGGGTGT
CTCTATGCGC CGGACGACAT CAAATTCAAC ACCGTCCCGT TCACGGTGGG CCTCGACTAT
CGATTGTCGG GAGACGGGCT GCTGTTCGCC AAGTTCTCCA AGGGCTATCG ATCGGGGGGA
TTCCAGCAAG CCAGCGGTAC GACCGCCGCC TTCTTCACGC CCTTCGACGA GGAAAGCGTG
GGAAGCTACG AGGCCGGCGC CAAGCTCTCG TTCTTCGAGA GGCGTCTGAG GGTCGGTCTG
AGCGGCTATT TCTCGACCTT TTCGGGGATC CAGCAGAACG CCATCCTTTC AGCCAGTCCT
ATCGTCATCG CCGTCCTCAA TTCGGGCAAG GCGGAGATCT ACGGCGGCGA GTTCGAGGCC
ACCGCCCTGC TGGGGGCTCT GCGCCTCAAC GCCGCGATCG GTCTGATCCA CCCGGAGTTC
ACCGAAGGAC CCTATAAGGG GTCGGAGGTG CCGACGGTCG CCAAGACCAC CTTTTCGCTC
AGCGCCGACT ATCCGGTCGA GCTGTCAGCC GGGCGGCTGG ACCTGCATGT CGACTACAAT
TACCGCTCCG AGGTGTTCTT CCTGAACACG GTGACGATCA CCGGGGCCGG GCCTGTTCCC
TTGACGGCCT TCCAGCGCCG CTCGGTCAGC CAGGATGGCT ACGGGTTGTT GAACGCCCAG
GCCTCGTTCA CCTTGGCGAA ATCTCCCGTC ACCGTGTCCG TCTACGGCAA GAACCTCGCC
AACGAATATT TCGGCGCGCG GTCGGGATCC TTCGCCGCGG CCAACTTCAA CACGATGGTG
ATCGGCGCGC CGCGCACCTA TGGGGTCAAC GTCTCCTACG CCTTCTGA
 
Protein sequence
MTSFRWKAAF CAGTAMVCLP AIVSAQTSPQ AEPVSPTGLE EVIVTAQRRE ERLQDVPVAV 
TAFGARELER RQINDVRGLT ENAPSITFTA TPYGNNDLIL AIRGVAPGGV LPNVDQAVGT
YVDGIYYARP EGSNLAMVDI ASAEVLRGPQ GTLFGRNTIG GALNITTNKP TYALDGSLKL
GYGNYNAITA TGIVNLPIVA DKLAARLVYS HVGHDGYGYN PSLGSDVADQ NDDFIRASVR
ADLSPDLRVD VSFDHYAGSN HQPLWVLNAY QAGLTAAQYA PYVAGPKSRV SYAGYNPTNK
SKVFDWTGTI TADLGFATLK TISGYRNIDF EGAADLDGTP LPTADVRAFE LDGDQVSQEI
QLYGTALANR LNWTAGAYYF REKLRNSPIT RVPATIQDNT IRPTNESVSV FGQVSFEVLP
RLRLTAGVRA VKDTRGMVYT PARFAVAAAS SSNPEPPASA VTPAACPFTA IGLNQAPGGC
LYAPDDIKFN TVPFTVGLDY RLSGDGLLFA KFSKGYRSGG FQQASGTTAA FFTPFDEESV
GSYEAGAKLS FFERRLRVGL SGYFSTFSGI QQNAILSASP IVIAVLNSGK AEIYGGEFEA
TALLGALRLN AAIGLIHPEF TEGPYKGSEV PTVAKTTFSL SADYPVELSA GRLDLHVDYN
YRSEVFFLNT VTITGAGPVP LTAFQRRSVS QDGYGLLNAQ ASFTLAKSPV TVSVYGKNLA
NEYFGARSGS FAAANFNTMV IGAPRTYGVN VSYAF