Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2021 |
Symbol | |
ID | 3831396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2108109 |
End bp | 2109623 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637829950 |
Product | xylose transporter ATP-binding subunit |
Protein accession | YP_430860 |
Protein GI | 83590851 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCACT ATATTCTAGA AATGCAGGAA ATTACAAAGC AATTTCCCGG CGTCAAGGCT CTAGACAAAG TTGATTTTAA GGCCAGGAAA GGTGAAATAC ATGCTTTATG TGGCGAAAAT GGGGCTGGTA AATCTACTTT AATGAAAGTC CTTAGCGGTG TATATCCTTA TGGCACCTAC CAGGGAGAAA TTTTGATTAA CGGTCAACCC CAAAAGTTCT ACACCATTAA GGATTCAGAA AGAGCCGGGA TAGCCATTAT TTACCAGGAG CTGGCTTTAG TTAGCGAATT ATCTGTTGCG GAAAACATTT TTCTGGGCAA TGAGCCTCTG CATCACCACT TAATTGATTG GGATAAAATG TATATAGAGG CAACGAAATG GCTCAAAGAG GTTGGCCTGG ATGTCAGCCC GGGAACTAAA ATTAAAAACC TGGGTGTAGG GCAACAGCAA CTAGTTGAGA TAGCCAAAGC TTTAGCGAAG AACGCCAGCA TTCTTGTTCT AGATGAACCT ACTGCCGCTT TGACAGAAGC TGAAGTAGAA ATTCTGATGC ATATTTTACA CCAGCTAAAA AGTAAGGGAG TAACATGTAT TTATATATCC CATAAGTTAA ATGAAGTCTT TGCAATTGCC GATAATATTA CTGTCCTTCG TGATGGTAGA ACCATTGGTA CGGTGAAAAA AGACGAAACG AGTCAAGATA AGATCATAAC AATGATGGTT GGCCGGGAAC TGAACAGGCT TTTTCCCTAT ATTAACCATA GTCCCGGAGC AATTACTTTG GAGGTTCGTA ACTTTAGCGT CTACAACCCA GATAACCCCC GCAAAAAAAT AGTAAAAGAT GTTAACTTTT ATGTCCGCAA AGGGGAAGTC CTGGGTATTG CCGGACTTAT AGGATCGGGT CGTACGGAAC TAGTTACTAG CATTTATGGT GGTTATCCAG GAAAAAATGA AGGAGAAATA TGGCTGGATG GGAGAAAGAT AAAAATAAAG AATTCTGAAG ATGCCCTTTC GAATGGGATT GCTCTCGTGC CGGAAGATCG CCGGCGTCAA GGGCTGGTAC TGGATATGGA TATCTGCAAA AATATAACCC TTGCCAGCTT AAAAAGATCA TATAATATTA TGCTTAACGA AAGCGCGGAA ATTAGAGATG CGGAATTCTT TGTTGATAAA TTGAAAATAA AGAGCCCCTC TGTAGAAGCT CGTGTGGGAA ACTTGAGTGG TGGGAATCAA CAAAAGGTAG TCCTGGGTAA AGCCCTGATG ACTAACCCCA GGGTTCTAAT TCTTGATGAA CCAACGCGGG GTATTGATGT GGGGGCTAAA TATGAAATAT ATAATCTAAT TAATAGTTTA GTTAGTCAGG GTGTAGCCAT AGTTATGGTG TCATCCGAGT TACCTGAGAT ATTGGGTATG AGCGATCGTA TTTTGGTGTT ATGTGAAGGT AGAATAACTG GCGAGTTTTC CCGTGAAGAA GCTACAGAAG AAAAAATAAT GGCCTGTGCA ACTGGAGGTA AGTAA
|
Protein sequence | MDHYILEMQE ITKQFPGVKA LDKVDFKARK GEIHALCGEN GAGKSTLMKV LSGVYPYGTY QGEILINGQP QKFYTIKDSE RAGIAIIYQE LALVSELSVA ENIFLGNEPL HHHLIDWDKM YIEATKWLKE VGLDVSPGTK IKNLGVGQQQ LVEIAKALAK NASILVLDEP TAALTEAEVE ILMHILHQLK SKGVTCIYIS HKLNEVFAIA DNITVLRDGR TIGTVKKDET SQDKIITMMV GRELNRLFPY INHSPGAITL EVRNFSVYNP DNPRKKIVKD VNFYVRKGEV LGIAGLIGSG RTELVTSIYG GYPGKNEGEI WLDGRKIKIK NSEDALSNGI ALVPEDRRRQ GLVLDMDICK NITLASLKRS YNIMLNESAE IRDAEFFVDK LKIKSPSVEA RVGNLSGGNQ QKVVLGKALM TNPRVLILDE PTRGIDVGAK YEIYNLINSL VSQGVAIVMV SSELPEILGM SDRILVLCEG RITGEFSREE ATEEKIMACA TGGK
|
| |