Gene Caul_0504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0504 
Symbol 
ID5897959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp547691 
End bp550024 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content65% 
IMG OID641560987 
ProductTonB-dependent receptor 
Protein accessionYP_001682136 
Protein GI167644473 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGG CATTGTGGCT AGCGGCTTCG GCCATGCTGG TGTGTCCCTA TGTGGCGGGG 
ACGGCCCTCG CGCAGGAGGC GCCGCCGGTC AAGGCGGCCG AACCTGAAAG CGCCAGGCTG
GACGAGGTGG TGGTCACCGC GCAGAAGCGT GCCGAGAACG TCCAGACCGT GCCGATCTCG
ATCGCGGCCT TCAGCGCCAC CCAACTGACC CAGGCGGGCG TCGATGACGT GCTCGAGATC
ACCAAGATCG TGCCGAGCTT CACCTCGACC CGTCAGGCGA ACGTTTCCTC CGTTCGACTC
AATATTCGCG GCATCGGGGC TTCCGCCCAG ACGGCCGTGG AGCCTTCGGT CGCCAGCTTC
CTCGACGATG TCTATGTGTC GCGGCCGGGC GCGGTCGTTG GCCGTTTCTA CGACGTCGAA
TCGGTCGAGG TCCTGAGGGG GCCCCAGGGC ACGCTGTTTG GCCGCAACGC CAGCGCCGGC
GCGCTCAGCA TCCACACCAA GAAGCCCACC GACCGGTTTG GCGGCGATCT TTCCGTCCAG
GCGGCGTCGT TCGGCAGCTA TGAGGCCTCA GGCGCCGTCA ATATTCCGAT CGGCGACCGC
GCTGCGGTTC GTATCGCCGG TGTGGGCGCT TCGACGGACG GGCCTTGGCA CACCGACATC
GGCGATCACG ACTACGGCGC CCTCGACACC ATCGGGGGGC GCGCGACGCT GCGCCTGAAG
CCGACCGACC AGATCGACTG GATCCTCCGC GCGGACTATC TGCACCTGAC CGGCGACGCC
GGATCCCACA ACACCGTCAA GAGCGATACG GTCACGCCCG CCGCGCGCGC CAACTATCTG
GCCCGCCTGG GGATCACGCC CTATTTCGAT GACCAGTTCA GCCGCACCAG CAACAATTTC
ATGGTCGGCG ACCTCGATGA CCACCAGTAC GGTCTGACCA GCGACCTGAC CTGGGACGTC
GCCGACGGCT ATTCGATCCG GCTGATCGAC GCCATGCGCG ACTGGTCGTC CGACCAGGTT
GCGGGCGACC TTGCCTTTTC GCCGCGGCCG CTTCTGCTGC GCAACGAAAT ACAGGCCTCA
TACAGTTCCA GCCACGAGCT CCAGTTCATC TCTCCTGGCG ACCAGCTTCT CGGGGGGCGG
CTGTCGTTTG TCTCCGGCCT TTACTATTTC GACGAGACGC TGACGATCGA CGAGAAGGTC
ACCTTCACCG CCGATACCTG CAACTACATC ATCCGCCTGG CGGCCCCCGC TCTGCAGGCC
GCCTGTCTGG CGAGCCCCCT TTCGCCGGGC GCGATCGCGA ATTTCGCTCA GAAGACCAGG
AGCTACGCTG CCTATGGTCA GGTGACGTAC AAGCTGACCA ATAGGCTCGA TCTCACCTTG
GGCGCGCGTT ACACCAACGA CGAGAAGTCC GGCTCCTTCG TTCAGACCAA TCCCAACGCC
GCCGCGCGCG TTCTAAGGTC GATCGAAGCC ACGGCGCTCG CGTTCAAGGA CGACCAGATC
ACCTGGCGCG CCAACCTGTC CTGGACGCCA GCGGACGACA TCCTGGTGTT CGCCAACTAT
GCCACCGGGT TCAAATCGGG CGGATTCAAC TCCGGCGGCG GGACGCCCGC CCTGACCGCC
GCGACCCGTC CGTTCGGGTC CGAAACAGTG GACGACTACG AACTGGGCGT GAAGTCGACC
CTGTTCGACC GGGTGCTGCA ACTCAATGCA ACGCTGTTTC GCACTGACGT CCACGATTAC
CAGGACCGCA GCTTCAACGG CCTCAGCTTC CTGGTGCGCA ATGCCGCCGA CCTGCGCCAG
CAGGGCGTGG AGGCCGACTT CATCCTGCGG CCCTTCGACG GACTGCGGAT CAACGGCGCC
GTGGGCTACC TGGATTCCAA ATATCTGAGC TACCCGGGCG CCTCCGGCCT GCCGGGCTTC
GGCGGGGTCC AGGATCTGTC GGGCAAGCGC AACAACTTCT CCCCAGAATG GCAAGGCGCC
GTCGGCGTCC AGTACGACCG GGACATCGCC GGCGGCGTCG GCCTGACCGT GAGAGGCGAC
ATGACCTTCG TCTCCGACAA CAACGTCGGC TCGGTGACGG ACGCCAACCC GCAGATGATC
CAAGACGGTT TCGCTCTGTT CTCGGGCCGG GTCACGTTCA CCTCGCCGGA CCGCCGCTAC
GGACTGCAGA TCTTCGGCGA GAACCTCTTC GACAAGGGCT ACTACACCTA CATCTTCCCG
AACACGCTCG ACAACGTCCT GGGGCTGAGG GACTCGACCA CCGGCGGCAC ATTGATGCGT
GGCACCCTGG GCACGCCGCG TACCGTGGGC GTGAAGCTGA CCGTGTCGTT CTGA
 
Protein sequence
MTKALWLAAS AMLVCPYVAG TALAQEAPPV KAAEPESARL DEVVVTAQKR AENVQTVPIS 
IAAFSATQLT QAGVDDVLEI TKIVPSFTST RQANVSSVRL NIRGIGASAQ TAVEPSVASF
LDDVYVSRPG AVVGRFYDVE SVEVLRGPQG TLFGRNASAG ALSIHTKKPT DRFGGDLSVQ
AASFGSYEAS GAVNIPIGDR AAVRIAGVGA STDGPWHTDI GDHDYGALDT IGGRATLRLK
PTDQIDWILR ADYLHLTGDA GSHNTVKSDT VTPAARANYL ARLGITPYFD DQFSRTSNNF
MVGDLDDHQY GLTSDLTWDV ADGYSIRLID AMRDWSSDQV AGDLAFSPRP LLLRNEIQAS
YSSSHELQFI SPGDQLLGGR LSFVSGLYYF DETLTIDEKV TFTADTCNYI IRLAAPALQA
ACLASPLSPG AIANFAQKTR SYAAYGQVTY KLTNRLDLTL GARYTNDEKS GSFVQTNPNA
AARVLRSIEA TALAFKDDQI TWRANLSWTP ADDILVFANY ATGFKSGGFN SGGGTPALTA
ATRPFGSETV DDYELGVKST LFDRVLQLNA TLFRTDVHDY QDRSFNGLSF LVRNAADLRQ
QGVEADFILR PFDGLRINGA VGYLDSKYLS YPGASGLPGF GGVQDLSGKR NNFSPEWQGA
VGVQYDRDIA GGVGLTVRGD MTFVSDNNVG SVTDANPQMI QDGFALFSGR VTFTSPDRRY
GLQIFGENLF DKGYYTYIFP NTLDNVLGLR DSTTGGTLMR GTLGTPRTVG VKLTVSF