Gene TM1040_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2551 
Symbol 
ID4076682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2693730 
End bp2694818 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content56% 
IMG OID638007875 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_614545 
Protein GI99082391 
COG category[R] General function prediction only 
COG ID[COG4174] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.280481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0903664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAACT ATATTCTGCG GCGTCTTCTG CTGGTGATCC CGACACTGAT CGGGATCATG 
GTAGTGAATT TCGCGCTCGT GCAGTTTGTC CCCGGTGGAC CGGTGGAGCA GATGATTGCC
CGCCTTGAGG GCGGTGGCGA TGTCTTTGGC GGCTTTGCTG GGGCGGCGAA TGATGCCGGG
GCAGAGACCG TTGGTCAGGC CGAAAGCCAA TACGCCGGGG CGCGGGGGCT GCCGCCTGAA
TTCATCGAGG AACTGGAGCG CGAGTTCGGC TTTGACAAAC CTCCCCTCGA GCGGTTCCTC
AACATGATGT GGAACTACGT CCGGCTTGAT TTCGGCGAGA GCTATTTTCG CAATATCGGC
GTGATTGATC TGGTGCTCGA AAAGATGCCA GTGTCGATCT CGCTGGGACT GTGGTCCACC
CTCATTGCCT ATCTGATCTC GATCCCGCTT GGTGTGAAAA AGGCGCTGCG CGATGGCAGC
AGCTTTGACA GTTGGACCAG TGGCGTGATC ATCGCGGCCT ATGCAATCCC GGGCTTCTTG
TTTGCGATCC TCCTTCTTGT GCTTCTGGCC GGGGGGTCAT ACTGGCAGAT TTTCCCTCTA
AGGGGCCTGA CATCCGAGAA TTGGGAAGAG CTGAGCCTTC TGGGCAAGAT CGGCGACTAT
TTCTGGCACA TTACCTTGCC GGTGGTGGCC TCGACCATTT CGGCCTTTGC GACGCTCACC
TTGCTGACCA AGAACTCCTT TTTGGACGAG ATCAAGAAAC AGTATGTGAT GACTGCCCGT
GCCAAAGGGT TGACGGAAAA CCGGGTGCTC TACGGGCATG TGTTTCGCAA TGCGATGCTG
ATCGTGATTG CTGGTTTTCC GGGGGTGTTC ATCTCGATCT TTTTCACCGG CAGCCTGATC
ATCGAGACGA TCTTCTCGCT CGACGGTCTC GGGCGGCTCG GGTTTGAGGC TGCGGTCGCG
CGTGACTACC CGATTGTATT TGGGACACTT TTCATCTTTG GTCTGATGGG CCTGGTGGTT
GGGATCTTGT CAGACATTAT GTATGTGCTG GTGGATCCAC GCATCGACTT TGAAAAGCGG
GAGGGATGA
 
Protein sequence
MLNYILRRLL LVIPTLIGIM VVNFALVQFV PGGPVEQMIA RLEGGGDVFG GFAGAANDAG 
AETVGQAESQ YAGARGLPPE FIEELEREFG FDKPPLERFL NMMWNYVRLD FGESYFRNIG
VIDLVLEKMP VSISLGLWST LIAYLISIPL GVKKALRDGS SFDSWTSGVI IAAYAIPGFL
FAILLLVLLA GGSYWQIFPL RGLTSENWEE LSLLGKIGDY FWHITLPVVA STISAFATLT
LLTKNSFLDE IKKQYVMTAR AKGLTENRVL YGHVFRNAML IVIAGFPGVF ISIFFTGSLI
IETIFSLDGL GRLGFEAAVA RDYPIVFGTL FIFGLMGLVV GILSDIMYVL VDPRIDFEKR
EG