Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2817 |
Symbol | thiP |
ID | 4076636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2980047 |
End bp | 2981618 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638008143 |
Product | thiamine transporter membrane protein |
Protein accession | YP_614811 |
Protein GI | 99082657 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1178] ABC-type Fe3+ transport system, permease component |
TIGRFAM ID | [TIGR01253] thiamine ABC transporter, permease protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.229213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.678489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCA GCGCTCAGCC GATAAGCTTG CCCGTCGTCG CGACCTGGCC CGGAACCTGC GCTGCTCTTG GTGTGTTGCT CTTGGTTCTG GGCACACTTG TCGCGGTCGC CTGGCATGCT GACGGGGGCG CTAGGTTTGC CGCAGCCGAT TGGGCCGCGC TTCGCTTCAC GCTCGTCCAA GCAAGCCTGT CTGCGCTGCT GAGTGTCGGG CTTGCCGTGC CTGTGGCGCG TGCCTTGGCG CGGCGGCGCT TTTGGGGGCG GCGTGCACTC ATTACCTTGC TCGGCGCGCC ATTCATCCTG CCCGTGATTG TTGCCATTCT CGGATTGCTC TCTGTCTTTG GGCGGTCTGG TTGGGTGAGT GACATCCTCA GCCTCATCGG CGTGCCGCCC TTGCAGATCT ACGGGCTGCA CGGGGTAGTG ATTGCCCATG TGTTCTTTAA CTTGCCACTG GCGACCCGTC TGATCCTTCA GGGCTGGCAA GAAATCCCGG CCGAACGCTT TCGTTTGGCG GCACAGTTGA ACGCGGGGCC GCGCGCAATA TGGTTCTTGA TCGAATTGCC GATGCTGCGA CAGGTCCTGC CGGGTGCTGC GGCGGTGATT TTCGCCATCT GCCTTTCCAG CTTTGCGGTT GCCTTGACCC TGGGGGGCGG GCCTCGCGCC ACCACGCTGG AACTCGCGAT CTACCAGGCC TTTCATTTTG ACTTTGATCT TGGGCGGGCG GCGATGCTTG CGACTGTGCA GCTTGGTCTG ACCATTGTGG CAGCACTTCT AGCGCTTCGG GCCTCCACCA CCGAGGGGCT GGGGGCTGGA CTTGATCGTG CGGTGATGCG TTGGGACGGC AAAGGCGCAT TCACGCGCTG TCTGGATGGT GTCTGGATTT TCGCTGCTGC TCTGTTCCTG CTGTTGCCAT TGCTGTCGGT GGTGATGGCA GGGCTCCCGG GGCTTTGGAC GCTGCCGCGT TCGGTCTGGA GCGCGGCATT GGTCTCTTTG CAGGTTGCAA CCCTGAGTAC GGTGCTGTTG TTGATTGTTG CGCTGCCGCT CTCGGCGAGC ATCGCGCTTG GACGCAAGCG CTTGGGCGAG GTCGCGGGGA TCCTTGGGCT CGCCGCGTCG CCATTGGTGA TTGGGACTGG ATTGTTTGTC ATGGTCTATC CCTTTGTGGA TCCGCGCCTC TTGGCTCTAC CGGTGACGGC CCTCGTGAAC ATGGTAATGG CCTTGCCCTT TGCGCTGCGC ATCCTCGTGC CACGCCTGCG GCAGGTGCTT TGGCAGAATG GCCGGCTTGC CTTGGCGCTG GATATGCGCG CGGGTACGGT CTGGAGGATT GTTCTGCTGC CGCGCTGCCG CGCGCAGATC GGATTCGCGA CCGGGCTTAC TGCTGCGCTT TCGCTTGGCG ATCTTGGTGT AATCGCACTT TTTGCTGATC CAGAAGTCGC CACCCTGCCG CTCCAGGTCT ACCGGCTGAT GGGGGCCTAC CGGATGGAGG CGGCGCAGGC AGCGGCGCTC CTGTTGTTTG TGTTGTCGAT AGGTGCATTC TGGGCGCTGG ACCGGGGAGG GCGTTTTCAT GCTGAGGCTT GA
|
Protein sequence | MARSAQPISL PVVATWPGTC AALGVLLLVL GTLVAVAWHA DGGARFAAAD WAALRFTLVQ ASLSALLSVG LAVPVARALA RRRFWGRRAL ITLLGAPFIL PVIVAILGLL SVFGRSGWVS DILSLIGVPP LQIYGLHGVV IAHVFFNLPL ATRLILQGWQ EIPAERFRLA AQLNAGPRAI WFLIELPMLR QVLPGAAAVI FAICLSSFAV ALTLGGGPRA TTLELAIYQA FHFDFDLGRA AMLATVQLGL TIVAALLALR ASTTEGLGAG LDRAVMRWDG KGAFTRCLDG VWIFAAALFL LLPLLSVVMA GLPGLWTLPR SVWSAALVSL QVATLSTVLL LIVALPLSAS IALGRKRLGE VAGILGLAAS PLVIGTGLFV MVYPFVDPRL LALPVTALVN MVMALPFALR ILVPRLRQVL WQNGRLALAL DMRAGTVWRI VLLPRCRAQI GFATGLTAAL SLGDLGVIAL FADPEVATLP LQVYRLMGAY RMEAAQAAAL LLFVLSIGAF WALDRGGRFH AEA
|
| |