Gene Moth_2225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2225 
Symbol 
ID3830832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2319955 
End bp2321154 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content55% 
IMG OID637830145 
Producthypothetical protein 
Protein accessionYP_431055 
Protein GI83591046 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.526626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000291502 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTTGTTTA AAGTAAGACG GGAAGCGATC AACACCCTGG CGGTAACCGT GGCCCTGGTC 
CTGATTGTCA CCCTGGCCCT TTATCTCAGG TTAAAGTTCG TTTTTACCAT CGACCATCCA
CCCTTGAAGG GCTTTTCCTC CGACGCTGTC AACTATGACC TCATGGCGCG CCAGTTCCTG
GATAAGGGGT TTCTGGGCTA TATGTCCAGC CGGCCCAATG CCTATATCAC CCCGGGCTAT
CCCCTGTTCC TGGCTTTGAT TTACAAACTC TACGGTTATG CCCAGGGGAG TCCCCTGCAG
GCGGTGCGGG TGGTCCAGGC CTGCCTGGGC ACCCTGACGG TGGTGCTGTT GTACCTGGCG
GGCCGGGAGG TAAAAAACAC CAGGGTGGGG CTGGTGGCCG CCCTGCTGGC GGCTATTTAC
CCCACCTTTG TCTGGGCCCC GACTATCCCT TTAACCGAAG TAGTTTATAC CTTCTTTTTT
ATGCTCTATT TTTACCTCCA GCTCCGGTAT TTACGCCATC CTTCCCCCCT GGGAGGCGTT
TTAACGGGGC TGATTTTTGG CCTGGCCATC CTGGTTCGCC CGGCGGCGGC CCCCCTGATC
GTGGTACCCT TCCTTTATGA TTTTTACCGG CGTAAGGAGT GGCGTTCCTC CCTCAAGGGT
TTCCTGTACA CCCTGGGAGG GTTTGTGGCC GTCATGCTGC CCTGGTGGAT ACGCAACCTG
GTAACCCTGC ACCAGTTCAT CCTCCTGGCC ACCCAGACCT GGAACCCCCT CCTTTACGGC
GCCTTTCCTT ACTTCACAGA TATGGACAAG GTGCCTCCCA TCCAGTCCAC CCAGGAGGCC
TTGCATTTTA TCCTCCGGGG CTTTTTAAGG AACCCGGTGT TGTACCTCAA GTGGTATACA
ATCGGCAAGT GGCAGGTTAT CTTTGGCAAT ATGTGGTACG GTCTTGACCT TTCCCGCTAT
CAGTACTTGC GTTCGGTTTA CTGGGTGCAC AATTTTATCA CCATGGTGGG CTGGTTGGGG
TCTTTTAAGG CCCTCAAGGA GGGAAGAGTA GGCCTGGTGG CCATCTTTAT TTTTCTCCTG
ACGGCCATCC AATTGATGTT TATCCCCACC GTCAGGTATG CCTTTACCAT CATGCCGTTC
TTGATGCTCA CCACCGCTTG GCTTATGGAT CTCCTGTTCG GAGCGGAAGA GGCTGCTTGA
 
Protein sequence
MLFKVRREAI NTLAVTVALV LIVTLALYLR LKFVFTIDHP PLKGFSSDAV NYDLMARQFL 
DKGFLGYMSS RPNAYITPGY PLFLALIYKL YGYAQGSPLQ AVRVVQACLG TLTVVLLYLA
GREVKNTRVG LVAALLAAIY PTFVWAPTIP LTEVVYTFFF MLYFYLQLRY LRHPSPLGGV
LTGLIFGLAI LVRPAAAPLI VVPFLYDFYR RKEWRSSLKG FLYTLGGFVA VMLPWWIRNL
VTLHQFILLA TQTWNPLLYG AFPYFTDMDK VPPIQSTQEA LHFILRGFLR NPVLYLKWYT
IGKWQVIFGN MWYGLDLSRY QYLRSVYWVH NFITMVGWLG SFKALKEGRV GLVAIFIFLL
TAIQLMFIPT VRYAFTIMPF LMLTTAWLMD LLFGAEEAA