Gene Moth_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1920 
Symbol 
ID3830844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1991746 
End bp1993146 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content55% 
IMG OID637829853 
Productamino acid permease-associated region 
Protein accessionYP_430763 
Protein GI83590754 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000169214 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.866406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA GTATCTTGCG CAAAAAAGAT ATCGCTACAG CCATGGAAAT GGCTGCCATG 
GACCAATATC GGCTTCGCCG GGAGTTGAAG GCCATGGATC TCTTTTTTCT GGTAATAGGT
ATTACCATAG GTGGGGGGAT ATTCGTCCTG CCGGGGGTCA TGGCGGCGGA ACACGCAGGT
CCCGGGGTTA CTATTTCTTT TTTAATCGGC GCGGTGGTAG CCATCTTTAC CGGCCTGTGT
TATGTTGAGT TTGCTTCTAT GGTCCCGGTG GCGGGAAGCG CTTATACTTA TTCTTATATT
GCCCTGGGAG AGATATTTGC CTGGATCATC GGTTGGGATG TCCTCCTGGA GTTCACCCTG
GTCTGCAGCG CCGTGGCTGT GGGCTGGTCC GGCTATATCG TTGAGTTGTT AAAGGATATG
GGCCTGAGTC TGCCCCCGGC CTTTACTACC GATATAGCCC ACGGCGGTAT AGTCAACCTG
CCGGCGGTTT TCATCCTCCT GGTGGTGGCT TATATTATTT ACGGCGGTAT CAGCCTGACA
GGTAAGGTCA ACGATGCCAT TGGGATTATA AAGCTCCTCA CCGTGGTGTT TTTTATTATC
GTGGCCCTCC CCTTTGTTAA ACCGGTAAAC TGGCAACCCT TTTTGCCCTT CGGCTGGCAA
GGGGTTATGA CTGCTGCCGC CCTGGGCTTC TTTGCTTACG GTGGTTTCGA CGCCGTCACA
ACTGCCGCCG AGGAGACCCG GAACCCCAAC CGCGATATAC CCTTAGGCCT GATCCTGGGA
CTGGTGGTAG TGGCTTCTCT TTATGTTCTT GTCTCCCTGG TGCTGACGGG GGTTATTCCT
TACACCAAAC TCGATACCCC GGCACCTGTG GCTTTTGCCC TCTCCTACCT GGGCAAACGC
TGGGGCGGGA GTCTGGTAGC CGCCGGGGCC ATCTGCGGCC TTTTTACAGT TATGATGGGG
GCTATGCTGG GTGGGAGCCG CATCCTGTTC GCCCTCAGCC GCGACGGTCT ATTGCCGCCG
GTTTTTTCCC GGGTACACGC AACCAGGCGT ACTCCCTACG TTGCCACATT GATCGTCCTG
ACAGTGGCCG TCCTGACAGG CGGTTTCCTC TCCCTGGGAG AATTGGTGGA ACTGGTGAAT
ATCGGCATGC TCACCGCCTA CCTCCTGACC TCTATTTCCA TCCTGGTCAT GCGCTTGAGA
TACCCGGAAA TTGAACGACC CTTCAGGGTT CCCGCCGTAT GGTTGGTGGC GCCGGTAGCC
ACCCTGGGAG TCGTGGCCCT GACCTTCAGC TTGCCAGGAG CGACGTTGGT TAGATTTGCC
ATCTGGTTTA TAGTCGGGAT GCTTATCTAC TTTGGCTATG GTATCAGGCA CTCGAAGCTG
GCTAACCGGG AAAATAATTA A
 
Protein sequence
MASSILRKKD IATAMEMAAM DQYRLRRELK AMDLFFLVIG ITIGGGIFVL PGVMAAEHAG 
PGVTISFLIG AVVAIFTGLC YVEFASMVPV AGSAYTYSYI ALGEIFAWII GWDVLLEFTL
VCSAVAVGWS GYIVELLKDM GLSLPPAFTT DIAHGGIVNL PAVFILLVVA YIIYGGISLT
GKVNDAIGII KLLTVVFFII VALPFVKPVN WQPFLPFGWQ GVMTAAALGF FAYGGFDAVT
TAAEETRNPN RDIPLGLILG LVVVASLYVL VSLVLTGVIP YTKLDTPAPV AFALSYLGKR
WGGSLVAAGA ICGLFTVMMG AMLGGSRILF ALSRDGLLPP VFSRVHATRR TPYVATLIVL
TVAVLTGGFL SLGELVELVN IGMLTAYLLT SISILVMRLR YPEIERPFRV PAVWLVAPVA
TLGVVALTFS LPGATLVRFA IWFIVGMLIY FGYGIRHSKL ANRENN