Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0156 |
Symbol | |
ID | 3831868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 156365 |
End bp | 157753 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828095 |
Product | amino acid permease-associated region |
Protein accession | YP_429037 |
Protein GI | 83589028 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.041937 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATAG CGGCAGCCAG GACCCGGAGC CAGGCCGGGC AAAAAGTGGG CCTGCGGCGG GATTTAGGGA TCTGGGAGAG CTATGCTACC TTAATTGGCG TCCTCATAGG TTCGGGCATT TTCGTGGTTA CCGGCCAGGC CGGGGCCGTT GCCGGGCCGT CGGTTCCCCT GGCCTACCTG GTAATGTACC CCATCGTCAT CTGTACCGCC GTAGCCTACA TGGTCTTCTT GAGCACCCCC CTGGGCGAGC GGCCGGGCGG TGCTTACATC CACATCTCCC GCACCTTTGG CACCTACTAC CCGGGCTATA TTGCCATGTG GCTGAAATGG GTGGCTTTTA TGGGAGCCCT GGGGGTCCTC TCCCTGGGCT TCGGCCAGTA CGTCACCTTT TTCATCCCGG GAGCCAATCC CGTTCTGGTG GGAAGCCTGG TACTGCTGTT TTTCTACTTT ATAAACCTCT TCGGGGTGCG CATCTATGGC TGGGCCCAGG TGGCCATGTT CCTGGTCTTA ATGATCGCCG TGCTGGTGCT GGTAATTCCC GGCCTGCCGG CGGTAAACCT GAGCTACTAC CGCCCCCTGT TCCCCTTTGG CCTGAAAGGA TTCCTGGCCG CCATCCCGCC CCTTTTCCTG TCCTATGCCG GCTTTGAGTC CCTGGCCCAA ACGGCCGGTG AGACCAGGGA GGCGCGGCGG TCCCTGCCGC GGGTCTTCCT GGTTGGTCTT TCCATAACCG TGGTAATTTA CTTTGCCATG TCCTTTGTCG CCTTTGGCAA CCTTCCATAC CAGCAGTTGG CCCATTCCCG GTCGGCCATG GCCGATGTGG CCGCCAGGTA CCTGCCCTTT GGGGCAGCAG CCATTGTCGC CGTGGGGGCC ATGATGGCCT TCACCACCTC CATTAACGGC ACCTTAATGG TACCGCCGCG GGTACTGATG GTCCTGGCCG AGGACCGGAT GATACCCGGG TTCCTGGCCC ATATCAACCC CCGTTTCCGG ACGCCGGATG TGGCCCTGAC CATCAGTACA GGCGTGGCCC TGGCGCTCCT GTGGACGAAA ACCCTTGACT ACATTCTGGC CGTCACCCTC CAGGCCATGT TTATCCTCTA TATTGTTCAC GGCATAGCCC TGATCTGCCT GCCCTTTGTG AACCCGCGCC TTTACAAAAC GGCGCTGGTG CGCCTCCATC CCGCCCTCCT GGTCATTAGC GGCTTGATAT CTATTGCTGC TATGCTTCTC TTTAGCTATG CCATGATTAT TGCCGCCTGG CGGCTGCTGC TCCTCTGGGT GGCTGTCGGT ACGGGGATCT ACCTGTATTC CCGTTACCAG GGGCGCCGGG AGGGTTTCGA TTACCAGCGC CGCCTGGTAG AAGAATGGTG TGACGAAGCT GAGGCATAA
|
Protein sequence | MSIAAARTRS QAGQKVGLRR DLGIWESYAT LIGVLIGSGI FVVTGQAGAV AGPSVPLAYL VMYPIVICTA VAYMVFLSTP LGERPGGAYI HISRTFGTYY PGYIAMWLKW VAFMGALGVL SLGFGQYVTF FIPGANPVLV GSLVLLFFYF INLFGVRIYG WAQVAMFLVL MIAVLVLVIP GLPAVNLSYY RPLFPFGLKG FLAAIPPLFL SYAGFESLAQ TAGETREARR SLPRVFLVGL SITVVIYFAM SFVAFGNLPY QQLAHSRSAM ADVAARYLPF GAAAIVAVGA MMAFTTSING TLMVPPRVLM VLAEDRMIPG FLAHINPRFR TPDVALTIST GVALALLWTK TLDYILAVTL QAMFILYIVH GIALICLPFV NPRLYKTALV RLHPALLVIS GLISIAAMLL FSYAMIIAAW RLLLLWVAVG TGIYLYSRYQ GRREGFDYQR RLVEEWCDEA EA
|
| |