Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1920 |
Symbol | |
ID | 3830844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1991746 |
End bp | 1993146 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829853 |
Product | amino acid permease-associated region |
Protein accession | YP_430763 |
Protein GI | 83590754 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000169214 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.866406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCA GTATCTTGCG CAAAAAAGAT ATCGCTACAG CCATGGAAAT GGCTGCCATG GACCAATATC GGCTTCGCCG GGAGTTGAAG GCCATGGATC TCTTTTTTCT GGTAATAGGT ATTACCATAG GTGGGGGGAT ATTCGTCCTG CCGGGGGTCA TGGCGGCGGA ACACGCAGGT CCCGGGGTTA CTATTTCTTT TTTAATCGGC GCGGTGGTAG CCATCTTTAC CGGCCTGTGT TATGTTGAGT TTGCTTCTAT GGTCCCGGTG GCGGGAAGCG CTTATACTTA TTCTTATATT GCCCTGGGAG AGATATTTGC CTGGATCATC GGTTGGGATG TCCTCCTGGA GTTCACCCTG GTCTGCAGCG CCGTGGCTGT GGGCTGGTCC GGCTATATCG TTGAGTTGTT AAAGGATATG GGCCTGAGTC TGCCCCCGGC CTTTACTACC GATATAGCCC ACGGCGGTAT AGTCAACCTG CCGGCGGTTT TCATCCTCCT GGTGGTGGCT TATATTATTT ACGGCGGTAT CAGCCTGACA GGTAAGGTCA ACGATGCCAT TGGGATTATA AAGCTCCTCA CCGTGGTGTT TTTTATTATC GTGGCCCTCC CCTTTGTTAA ACCGGTAAAC TGGCAACCCT TTTTGCCCTT CGGCTGGCAA GGGGTTATGA CTGCTGCCGC CCTGGGCTTC TTTGCTTACG GTGGTTTCGA CGCCGTCACA ACTGCCGCCG AGGAGACCCG GAACCCCAAC CGCGATATAC CCTTAGGCCT GATCCTGGGA CTGGTGGTAG TGGCTTCTCT TTATGTTCTT GTCTCCCTGG TGCTGACGGG GGTTATTCCT TACACCAAAC TCGATACCCC GGCACCTGTG GCTTTTGCCC TCTCCTACCT GGGCAAACGC TGGGGCGGGA GTCTGGTAGC CGCCGGGGCC ATCTGCGGCC TTTTTACAGT TATGATGGGG GCTATGCTGG GTGGGAGCCG CATCCTGTTC GCCCTCAGCC GCGACGGTCT ATTGCCGCCG GTTTTTTCCC GGGTACACGC AACCAGGCGT ACTCCCTACG TTGCCACATT GATCGTCCTG ACAGTGGCCG TCCTGACAGG CGGTTTCCTC TCCCTGGGAG AATTGGTGGA ACTGGTGAAT ATCGGCATGC TCACCGCCTA CCTCCTGACC TCTATTTCCA TCCTGGTCAT GCGCTTGAGA TACCCGGAAA TTGAACGACC CTTCAGGGTT CCCGCCGTAT GGTTGGTGGC GCCGGTAGCC ACCCTGGGAG TCGTGGCCCT GACCTTCAGC TTGCCAGGAG CGACGTTGGT TAGATTTGCC ATCTGGTTTA TAGTCGGGAT GCTTATCTAC TTTGGCTATG GTATCAGGCA CTCGAAGCTG GCTAACCGGG AAAATAATTA A
|
Protein sequence | MASSILRKKD IATAMEMAAM DQYRLRRELK AMDLFFLVIG ITIGGGIFVL PGVMAAEHAG PGVTISFLIG AVVAIFTGLC YVEFASMVPV AGSAYTYSYI ALGEIFAWII GWDVLLEFTL VCSAVAVGWS GYIVELLKDM GLSLPPAFTT DIAHGGIVNL PAVFILLVVA YIIYGGISLT GKVNDAIGII KLLTVVFFII VALPFVKPVN WQPFLPFGWQ GVMTAAALGF FAYGGFDAVT TAAEETRNPN RDIPLGLILG LVVVASLYVL VSLVLTGVIP YTKLDTPAPV AFALSYLGKR WGGSLVAAGA ICGLFTVMMG AMLGGSRILF ALSRDGLLPP VFSRVHATRR TPYVATLIVL TVAVLTGGFL SLGELVELVN IGMLTAYLLT SISILVMRLR YPEIERPFRV PAVWLVAPVA TLGVVALTFS LPGATLVRFA IWFIVGMLIY FGYGIRHSKL ANRENN
|
| |