Gene TM1040_1272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1272 
Symbol 
ID4077432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1370689 
End bp1371888 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID638006580 
Productextracellular ligand-binding receptor 
Protein accessionYP_613267 
Protein GI99081113 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.570271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TGCTGACCGC GACGGCAGCC ATCGCATTGA GCGCTGGCAC CGCGTTTGCA 
GACGGCCACG GCCACCCCGA CGAAGTAAAG CTGGGTGTCC TGTTTGGCTT CACCGGCCCG
ATCGAATCCC TGGCGCCGAC CATGGCCTCT GGTGCTGAAC TCGCGATGTC CGAAGTCACC
GAGTCCGGCA AACTCTTTGG CGGTGCAAAA GTGACCCCGA TGCGCGCGGA CACCGGCTGT
ATCGACAATG GTCTTGCGAC TGCGAACGCA GAAAAGCTGA TCGCAGATGG CGCCAACGGC
ATCGTGGGTG GTGACTGTTC CGGCGTGACC GGCGCGATCC TGCAGAACGT CGCGATCCCG
AACGGCATGG TGATGATTTC TCCCTCCGCA AGCTCGCCGG GTCTGACCTC GATGGAAGAC
AACGGCCTGT TCTTCCGGAC CACCCCGTCT GACGCACGTC AGGGCGAGAT CATGGCGTCG
ATCCTTGCAG ATCGTGGCGT CGACAGCATC GCCATCACCT ATACCAACAA CGATTACGGC
AAGGGTCTGT CGGATTCGAT CAAATCCGCA TTCGAGGCCG CAGGCGGTGA AGTCACCATC
GTGACCGCGC ATGAAGACGG CAAGGGTGAC TACTCTGCCG AGGTTGCGGC GCTGGCATCA
GCCGGTGGCG ATATTCTGGT TGTTGCGGGC TATCTCGACC AGGGTGGTCT GGGCATCATC
CAGGGCGCGC TCGACACCGG TGCGTTCGAC ACCTTTGGTC TGCCGGACGG GATGATCGGC
GATTCGCTGC CCAACAACGT GGGCCCGGAC CTCAATGGCT CCTTCGGGCA GATCGCCGGC
TCTGACAGTG AAGGTGCCGA GATGTTCGCT GCCAAAGCCT CCGAGCTTGG CTTTGACGGT
ACTTCTGCCT ATTCGCCGGA ATCCTATGAT GCGGCAGCGC TTTTCCTGCT CGCGATGCAG
GCATCGGGCT CTGTTGATCC CAAGGATTAC GTCGCCAAGA TCACCGAAGT CGCCAATGCT
CCGGGTGAGA AAATCAACCC CGGTGAGCTC GGCAAAGCGC TCGAAATTCT CGCCAATGGC
GGTGAGATCG ACTATGAGGG CGCAACCGGC GTCAACCTGA TCGGCCCCGG CGAGAGCGCA
GGCTCTTTCC GTGAGATCGA AGTTCAGGAC GGCAAGAACG TGACCGTGAA ATTCCGCTAA
 
Protein sequence
MKKLLTATAA IALSAGTAFA DGHGHPDEVK LGVLFGFTGP IESLAPTMAS GAELAMSEVT 
ESGKLFGGAK VTPMRADTGC IDNGLATANA EKLIADGANG IVGGDCSGVT GAILQNVAIP
NGMVMISPSA SSPGLTSMED NGLFFRTTPS DARQGEIMAS ILADRGVDSI AITYTNNDYG
KGLSDSIKSA FEAAGGEVTI VTAHEDGKGD YSAEVAALAS AGGDILVVAG YLDQGGLGII
QGALDTGAFD TFGLPDGMIG DSLPNNVGPD LNGSFGQIAG SDSEGAEMFA AKASELGFDG
TSAYSPESYD AAALFLLAMQ ASGSVDPKDY VAKITEVANA PGEKINPGEL GKALEILANG
GEIDYEGATG VNLIGPGESA GSFREIEVQD GKNVTVKFR