Gene TM1040_0542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0542 
Symbol 
ID4077189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp574699 
End bp575739 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content64% 
IMG OID638005839 
ProductABC transporter related 
Protein accessionYP_612537 
Protein GI99080383 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.999499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGC TGACCCTGTC CGACATCACC AAACACTTTG GCGACGCGCC TGCTTTGCGC 
TCCATCTCGA CCACTGTGCG CGATGGAGAG TTCCTCGCCC TTCTGGGTCC TTCGGGCTGC
GGGAAGTCCA CGCTTTTGCG ACTGCTGGCC GGATTCGAGA CCCCCAGCGA GGGGCGCATT
TCCATTGGAA CGCGCGAGGT CGCCAATGCA GAAAGACGCC TGAACCTGCC GCCCGAAGAA
CGCAACATCG GCTTTGTGTT CCAGTCCTAT GCGCTCTGGC CGCATATGAA CGTGCGGCGC
AATGTGGGCT ATCCGCTGGA AATCCGCCGC CTGCCCCGGG CAGAGCAAGA CGCCCGGATT
GATGCGGCGC TGCAGGCGAC CGCCCTTGAA CCCTATGGCG CGCGGATGCC CGCGGAGCTT
TCGGGTGGGC AACGGCAGCG GGTGGCGCTT GCGCGCTGTC TGGTCTCTGA TCCTACGGCG
GTTTTGCTGG ATGAGCCGCT CGCAAATCTG GACGTGGCCT TGCGCGCGTC GATGCAACAG
GTCTTTAGCG ACTTTCACCA GCGCACTCAG GCCACCATGG TCTATGTCAC TCACGACCAG
GCCGAGGCGA TGGCCATGGC GGATCGCATC GCGGTCATGA ACCAGGGTGA GATCGTACAA
CTCGACACGC CCGAGGCCCT TTATGCCCGC CCCCGCAGCC GCTTTGTCGC CGAATTCGTG
GGCGAAGGCG CTGTGGTGCC CCTGATCGGC GCGACCCATC ACGACAGAGG CGCAGAGGCG
ACCTTGCTCG GGGCGCGCTA CCCCATTGAA ACCGACACTC AAACCCCGGC GCTTGCCTGC
CTCAGACCTG AAAACCTGCA GATTTCGGAC GCCGGCAACA TTCGCGCGCG GGTGGAGCGG
GTGACGTATC TCGGGGGACG CTACCGGCTC GAACTGACCG CCGCGAGTGG CGACAGCCTG
GTAACCCAAT CCGCAACCCG CTTTGCCCTA GGCGAACAGA TCGGCCTGAC CCTATCCGCC
CCATGGGCCT TTGCCGCCTA G
 
Protein sequence
MAELTLSDIT KHFGDAPALR SISTTVRDGE FLALLGPSGC GKSTLLRLLA GFETPSEGRI 
SIGTREVANA ERRLNLPPEE RNIGFVFQSY ALWPHMNVRR NVGYPLEIRR LPRAEQDARI
DAALQATALE PYGARMPAEL SGGQRQRVAL ARCLVSDPTA VLLDEPLANL DVALRASMQQ
VFSDFHQRTQ ATMVYVTHDQ AEAMAMADRI AVMNQGEIVQ LDTPEALYAR PRSRFVAEFV
GEGAVVPLIG ATHHDRGAEA TLLGARYPIE TDTQTPALAC LRPENLQISD AGNIRARVER
VTYLGGRYRL ELTAASGDSL VTQSATRFAL GEQIGLTLSA PWAFAA