Gene TM1040_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0496 
Symbol 
ID4078242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp518906 
End bp520426 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content62% 
IMG OID638005792 
ProductABC transporter related 
Protein accessionYP_612491 
Protein GI99080337 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0969653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGC CTCCCCTTCT GACCCTTGAC GGCCTCACCA AGGCCTATCC CGGCGTCGTC 
GCCAATGACC AGGTCTCCTT CACCATCGGC ACAGGCGAGG TGCATGCGCT TCTGGGTGAA
AACGGTGCCG GAAAATCGAC ACTCGTGAAG ATGATCTATG GCCTCGTGAA ACCGGATTCC
GGTCGCATGA CCCTGCGCGG AGCGGCCTAT GAGCCCTCCG AGCCGCGTCA AGCGCGCGCA
GACGGCATTG CCATGGTATT TCAGCATTTT TCGCTATTTG ATGCGCTGAA TGTGGCAGAA
AACATTGCCC TTGGCATGGA GAGCCCACCG GCGCTGCGTG ATCTGGCGGC CCGGATCCGT
GAGGTGTCCG AAGCCTATGG GCTGCCGCTC GATCCTTACC GCACGGTCGG AGACCTTTCG
GCGGGCGAGC GTCAGCGGGT GGAGATCATT CGCTGCCTGC TGCAGGACCC AAAGCTTCTG
ATCATGGACG AGCCGACATC AGTTCTGACA CCGCAAGAGG TGGAGATCCT CTTTGAGACC
CTGCGCAAGC TGAAGTCCGA AGGGACCTCG ATCCTTTATA TCTCGCACAA GCTCGAAGAA
ATCCGCGCGC TCTGCGACCA CGCGACGATC CTGCGGCTCG GCAAGAATGT GGGCGAATGC
GTGCCGCGCG AGACCTCTGC CCGCGAGATG GCAGAACTGA TGGTGGGTAG CGCGCTCAAG
ACGCCGGAAC GCGGCGCGCG CAGATTTGGT GATGTGGCAC TGGATATTTC TGGCCTTTCG
GTGCCCGCGC CTTCGGCATT TGGCACCGCG TTGAAGAATG TGCACCTGAC CGTGCGTCGG
GGCGAGATCC TTGGCATTGG TGGTGTTGCA GGCAACGGGC AGGACGAGCT GCTATCAGTG
ATGTCGGGTG AGGTGACCAC AGCCCGCGAC GCGGTGAAAT TCGACGGCCA GCCCATCGGG
CGAATGGGCC CAACCGCGCG CCGCGCTCTT GGTGTCCTCA GTGCTCCAGA AGAGCGCCTG
GGCCATGCCG CCGCGCCGGA TATGAGCCTG ACCGAAAATG CGCTTTTGAC CGGGTCGGTG
CGCGAGGGGC TGGAGCAGAA CGGTTTTTTG AAATGGGGGG CGACCAAGTC CTTTGCAGAG
AAGATCATCA AGGCGTTTGA TGTGCGCACA CCGGGACCGG AGAACGCGGC GCGCTCGCTC
TCTGGTGGGA ATTTGCAGAA GTTCGTCATT GGCCGTGAGG TGCTGCAGCG CCCCGAGGTC
CTGATCGTGA ACCAGCCCAC ATGGGGCGTG GATGCGGCGG CGGCGGCGGC AATACGGCAG
TCGCTACTTG ATCTTGCCGC GCAGGGGACT GCGGTGATCT GCATCAGTCA GGATCTCGAT
GAGCTGATGG AAATCTCCGA CAATTTCGCC GCCCTCAATG AGGGTCGTCT GTCCGCCCCG
CGCCCCACGG GGGAGCTGAC GGTGGATGAG ATCGGCCTGA TGATGGGCGG TGCACATGGC
ATGGAAGTGG CGCATGTGTA G
 
Protein sequence
MSTPPLLTLD GLTKAYPGVV ANDQVSFTIG TGEVHALLGE NGAGKSTLVK MIYGLVKPDS 
GRMTLRGAAY EPSEPRQARA DGIAMVFQHF SLFDALNVAE NIALGMESPP ALRDLAARIR
EVSEAYGLPL DPYRTVGDLS AGERQRVEII RCLLQDPKLL IMDEPTSVLT PQEVEILFET
LRKLKSEGTS ILYISHKLEE IRALCDHATI LRLGKNVGEC VPRETSAREM AELMVGSALK
TPERGARRFG DVALDISGLS VPAPSAFGTA LKNVHLTVRR GEILGIGGVA GNGQDELLSV
MSGEVTTARD AVKFDGQPIG RMGPTARRAL GVLSAPEERL GHAAAPDMSL TENALLTGSV
REGLEQNGFL KWGATKSFAE KIIKAFDVRT PGPENAARSL SGGNLQKFVI GREVLQRPEV
LIVNQPTWGV DAAAAAAIRQ SLLDLAAQGT AVICISQDLD ELMEISDNFA ALNEGRLSAP
RPTGELTVDE IGLMMGGAHG MEVAHV