Gene Moth_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0156 
Symbol 
ID3831868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp156365 
End bp157753 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content60% 
IMG OID637828095 
Productamino acid permease-associated region 
Protein accessionYP_429037 
Protein GI83589028 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.041937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATAG CGGCAGCCAG GACCCGGAGC CAGGCCGGGC AAAAAGTGGG CCTGCGGCGG 
GATTTAGGGA TCTGGGAGAG CTATGCTACC TTAATTGGCG TCCTCATAGG TTCGGGCATT
TTCGTGGTTA CCGGCCAGGC CGGGGCCGTT GCCGGGCCGT CGGTTCCCCT GGCCTACCTG
GTAATGTACC CCATCGTCAT CTGTACCGCC GTAGCCTACA TGGTCTTCTT GAGCACCCCC
CTGGGCGAGC GGCCGGGCGG TGCTTACATC CACATCTCCC GCACCTTTGG CACCTACTAC
CCGGGCTATA TTGCCATGTG GCTGAAATGG GTGGCTTTTA TGGGAGCCCT GGGGGTCCTC
TCCCTGGGCT TCGGCCAGTA CGTCACCTTT TTCATCCCGG GAGCCAATCC CGTTCTGGTG
GGAAGCCTGG TACTGCTGTT TTTCTACTTT ATAAACCTCT TCGGGGTGCG CATCTATGGC
TGGGCCCAGG TGGCCATGTT CCTGGTCTTA ATGATCGCCG TGCTGGTGCT GGTAATTCCC
GGCCTGCCGG CGGTAAACCT GAGCTACTAC CGCCCCCTGT TCCCCTTTGG CCTGAAAGGA
TTCCTGGCCG CCATCCCGCC CCTTTTCCTG TCCTATGCCG GCTTTGAGTC CCTGGCCCAA
ACGGCCGGTG AGACCAGGGA GGCGCGGCGG TCCCTGCCGC GGGTCTTCCT GGTTGGTCTT
TCCATAACCG TGGTAATTTA CTTTGCCATG TCCTTTGTCG CCTTTGGCAA CCTTCCATAC
CAGCAGTTGG CCCATTCCCG GTCGGCCATG GCCGATGTGG CCGCCAGGTA CCTGCCCTTT
GGGGCAGCAG CCATTGTCGC CGTGGGGGCC ATGATGGCCT TCACCACCTC CATTAACGGC
ACCTTAATGG TACCGCCGCG GGTACTGATG GTCCTGGCCG AGGACCGGAT GATACCCGGG
TTCCTGGCCC ATATCAACCC CCGTTTCCGG ACGCCGGATG TGGCCCTGAC CATCAGTACA
GGCGTGGCCC TGGCGCTCCT GTGGACGAAA ACCCTTGACT ACATTCTGGC CGTCACCCTC
CAGGCCATGT TTATCCTCTA TATTGTTCAC GGCATAGCCC TGATCTGCCT GCCCTTTGTG
AACCCGCGCC TTTACAAAAC GGCGCTGGTG CGCCTCCATC CCGCCCTCCT GGTCATTAGC
GGCTTGATAT CTATTGCTGC TATGCTTCTC TTTAGCTATG CCATGATTAT TGCCGCCTGG
CGGCTGCTGC TCCTCTGGGT GGCTGTCGGT ACGGGGATCT ACCTGTATTC CCGTTACCAG
GGGCGCCGGG AGGGTTTCGA TTACCAGCGC CGCCTGGTAG AAGAATGGTG TGACGAAGCT
GAGGCATAA
 
Protein sequence
MSIAAARTRS QAGQKVGLRR DLGIWESYAT LIGVLIGSGI FVVTGQAGAV AGPSVPLAYL 
VMYPIVICTA VAYMVFLSTP LGERPGGAYI HISRTFGTYY PGYIAMWLKW VAFMGALGVL
SLGFGQYVTF FIPGANPVLV GSLVLLFFYF INLFGVRIYG WAQVAMFLVL MIAVLVLVIP
GLPAVNLSYY RPLFPFGLKG FLAAIPPLFL SYAGFESLAQ TAGETREARR SLPRVFLVGL
SITVVIYFAM SFVAFGNLPY QQLAHSRSAM ADVAARYLPF GAAAIVAVGA MMAFTTSING
TLMVPPRVLM VLAEDRMIPG FLAHINPRFR TPDVALTIST GVALALLWTK TLDYILAVTL
QAMFILYIVH GIALICLPFV NPRLYKTALV RLHPALLVIS GLISIAAMLL FSYAMIIAAW
RLLLLWVAVG TGIYLYSRYQ GRREGFDYQR RLVEEWCDEA EA