Gene TM1040_2817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2817 
SymbolthiP 
ID4076636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2980047 
End bp2981618 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content63% 
IMG OID638008143 
Productthiamine transporter membrane protein 
Protein accessionYP_614811 
Protein GI99082657 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1178] ABC-type Fe3+ transport system, permease component 
TIGRFAM ID[TIGR01253] thiamine ABC transporter, permease protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.229213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.678489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCA GCGCTCAGCC GATAAGCTTG CCCGTCGTCG CGACCTGGCC CGGAACCTGC 
GCTGCTCTTG GTGTGTTGCT CTTGGTTCTG GGCACACTTG TCGCGGTCGC CTGGCATGCT
GACGGGGGCG CTAGGTTTGC CGCAGCCGAT TGGGCCGCGC TTCGCTTCAC GCTCGTCCAA
GCAAGCCTGT CTGCGCTGCT GAGTGTCGGG CTTGCCGTGC CTGTGGCGCG TGCCTTGGCG
CGGCGGCGCT TTTGGGGGCG GCGTGCACTC ATTACCTTGC TCGGCGCGCC ATTCATCCTG
CCCGTGATTG TTGCCATTCT CGGATTGCTC TCTGTCTTTG GGCGGTCTGG TTGGGTGAGT
GACATCCTCA GCCTCATCGG CGTGCCGCCC TTGCAGATCT ACGGGCTGCA CGGGGTAGTG
ATTGCCCATG TGTTCTTTAA CTTGCCACTG GCGACCCGTC TGATCCTTCA GGGCTGGCAA
GAAATCCCGG CCGAACGCTT TCGTTTGGCG GCACAGTTGA ACGCGGGGCC GCGCGCAATA
TGGTTCTTGA TCGAATTGCC GATGCTGCGA CAGGTCCTGC CGGGTGCTGC GGCGGTGATT
TTCGCCATCT GCCTTTCCAG CTTTGCGGTT GCCTTGACCC TGGGGGGCGG GCCTCGCGCC
ACCACGCTGG AACTCGCGAT CTACCAGGCC TTTCATTTTG ACTTTGATCT TGGGCGGGCG
GCGATGCTTG CGACTGTGCA GCTTGGTCTG ACCATTGTGG CAGCACTTCT AGCGCTTCGG
GCCTCCACCA CCGAGGGGCT GGGGGCTGGA CTTGATCGTG CGGTGATGCG TTGGGACGGC
AAAGGCGCAT TCACGCGCTG TCTGGATGGT GTCTGGATTT TCGCTGCTGC TCTGTTCCTG
CTGTTGCCAT TGCTGTCGGT GGTGATGGCA GGGCTCCCGG GGCTTTGGAC GCTGCCGCGT
TCGGTCTGGA GCGCGGCATT GGTCTCTTTG CAGGTTGCAA CCCTGAGTAC GGTGCTGTTG
TTGATTGTTG CGCTGCCGCT CTCGGCGAGC ATCGCGCTTG GACGCAAGCG CTTGGGCGAG
GTCGCGGGGA TCCTTGGGCT CGCCGCGTCG CCATTGGTGA TTGGGACTGG ATTGTTTGTC
ATGGTCTATC CCTTTGTGGA TCCGCGCCTC TTGGCTCTAC CGGTGACGGC CCTCGTGAAC
ATGGTAATGG CCTTGCCCTT TGCGCTGCGC ATCCTCGTGC CACGCCTGCG GCAGGTGCTT
TGGCAGAATG GCCGGCTTGC CTTGGCGCTG GATATGCGCG CGGGTACGGT CTGGAGGATT
GTTCTGCTGC CGCGCTGCCG CGCGCAGATC GGATTCGCGA CCGGGCTTAC TGCTGCGCTT
TCGCTTGGCG ATCTTGGTGT AATCGCACTT TTTGCTGATC CAGAAGTCGC CACCCTGCCG
CTCCAGGTCT ACCGGCTGAT GGGGGCCTAC CGGATGGAGG CGGCGCAGGC AGCGGCGCTC
CTGTTGTTTG TGTTGTCGAT AGGTGCATTC TGGGCGCTGG ACCGGGGAGG GCGTTTTCAT
GCTGAGGCTT GA
 
Protein sequence
MARSAQPISL PVVATWPGTC AALGVLLLVL GTLVAVAWHA DGGARFAAAD WAALRFTLVQ 
ASLSALLSVG LAVPVARALA RRRFWGRRAL ITLLGAPFIL PVIVAILGLL SVFGRSGWVS
DILSLIGVPP LQIYGLHGVV IAHVFFNLPL ATRLILQGWQ EIPAERFRLA AQLNAGPRAI
WFLIELPMLR QVLPGAAAVI FAICLSSFAV ALTLGGGPRA TTLELAIYQA FHFDFDLGRA
AMLATVQLGL TIVAALLALR ASTTEGLGAG LDRAVMRWDG KGAFTRCLDG VWIFAAALFL
LLPLLSVVMA GLPGLWTLPR SVWSAALVSL QVATLSTVLL LIVALPLSAS IALGRKRLGE
VAGILGLAAS PLVIGTGLFV MVYPFVDPRL LALPVTALVN MVMALPFALR ILVPRLRQVL
WQNGRLALAL DMRAGTVWRI VLLPRCRAQI GFATGLTAAL SLGDLGVIAL FADPEVATLP
LQVYRLMGAY RMEAAQAAAL LLFVLSIGAF WALDRGGRFH AEA