Gene TM1040_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2237 
Symbol 
ID4077304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2350145 
End bp2351275 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content60% 
IMG OID638007559 
Productextracellular ligand-binding receptor 
Protein accessionYP_614231 
Protein GI99082077 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.246635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGC TAATGTTGGC CACAGCGGCC GCGGCGCTGG CCGCGGGCGG AGCGATGGCA 
GAGGTCAAAG TCGGGATGAT CACCACGCTT TCGGGCGGCG GTGCGAGCCT CGGCATCGAC
ACGCGGGACG GGTTCATGCT GGCGATGGAA GCCGCAGGTC GCGATGATGT CGAAGTGGTC
ATCGAGGACG ACCAGCGCAA GCCCGACATC GCCGTGCAGC TTGCCGATAA GATGATCCAG
TCCGAAAAGG TCGACGTGAT GACCGGTATC GTCTGGTCCA ACCTTGCAAT GGCCGTGGTG
CCTGCGACTA CCGCGCAGGG GCTGTTCTAT CTTTCGACCA ACGCGGCCCC CGCACAGCTG
GCGGGCAAAG GCTGCAACGC CAATTATTTC TCGGTCGCCT ACCAGAACGA CAACCTGCAT
GAAGGCGCGG GCGCCTATGC AACGCAGGCG GGGTTCAAGA ACACCTTCAT TCTCGCACCG
AACTACCCGG CGGGGATCGA CAGCCTCACT GGCTTCAAAC GTTTCTATGA AGGCGATCTC
GCAGGGGAGG TCTACACCAA GCTCGGCCAG ACTGATTACG CGGCTGAAAT CGCGCAGATC
CGCGCATCCG GCGCCGACAG CGTGTTCTTC TTCCTGCCCG GCGGCATGGG GATTTCCTTC
CTGAAGCAAT ATTCTGACAG CGGCGTCGAC CTGCCCGTCG TCGGCCCGGC CTTCAGCTTT
GATCAGGGCA TCCTGCAAGC GGTGGGCGAA GCGGCGCTTG GCGTCAAGAA CTCCTCCACC
TGGTCCAAGG ATCTGGACAA TGAGGCCAAC GCGGCCTTTG TTGCGGCCTT CCAGCAGAAA
TACGACCGTC TGCCGTCGAT CTATGCGGCG CAGGGCTATG ACACCGCAAA CCTGCTGCTG
TCGGCCATCG ACAAGGCGGA TGTGAATGAT GACGCAGCCT TTGCTGCGGC CCTCAAGGAG
GCCGATTTTG CCTCTGTGCG CGGCGAATTC TCCTTTGCGG CCAACAACCA CCCGATCCAG
AACGTCTATG TGCGTGAGGT CATCAAGGAA GGCGACGTCT ACACCAACAA GATCGTCGGC
ACCGCTCTTG AGGATCATGC AAACGCCTAT GTGGACGAGT GCAAGATGTA A
 
Protein sequence
MKKLMLATAA AALAAGGAMA EVKVGMITTL SGGGASLGID TRDGFMLAME AAGRDDVEVV 
IEDDQRKPDI AVQLADKMIQ SEKVDVMTGI VWSNLAMAVV PATTAQGLFY LSTNAAPAQL
AGKGCNANYF SVAYQNDNLH EGAGAYATQA GFKNTFILAP NYPAGIDSLT GFKRFYEGDL
AGEVYTKLGQ TDYAAEIAQI RASGADSVFF FLPGGMGISF LKQYSDSGVD LPVVGPAFSF
DQGILQAVGE AALGVKNSST WSKDLDNEAN AAFVAAFQQK YDRLPSIYAA QGYDTANLLL
SAIDKADVND DAAFAAALKE ADFASVRGEF SFAANNHPIQ NVYVREVIKE GDVYTNKIVG
TALEDHANAY VDECKM