Gene Moth_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2440 
SymbolsecY 
ID3831670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2556605 
End bp2557867 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content52% 
IMG OID637830359 
Productpreprotein translocase subunit SecY 
Protein accessionYP_431265 
Protein GI83591256 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit
[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAGACA CCCTGGCCAG CGCCTGGAAA CTGGAAGACC TGCGGAAGAA GATTTTTTTC 
ACTTTGCTCA TGTTTGTAGT CTTTCGCCTG GGTGCCCATG TTCCGGTACC GGGTATCAAC
AATGCCATTT TAAAGGAATT GATTGGTACC GGGACGATCT TCGGGTTTTT TGACGTAATT
TCCGGCGGGG CTTTTAAGCG CTTTACCATC TTTGCCATGG GCATTATGCC CTACATCAAT
GCCTCGATTA TCATGCAGCT CCTGACAGTG GTCATCCCGG CCCTGGAGCG CCTGGCCAAG
GAGGATATTG AGGGCCGAAA AAAAATCGTC CAGTATACAC GCTACGGGAC AGTTATTTTA
AGTATACTCC AGGCCCTGGG CATGGGCCTG TACCTGGCTC GCTCCCATGC CTTCCTGCGG
CCGGGCCTTT ATAACTACCT GGTGGTAGTT ATAATGCTTA CAGCGGGAAC GACTTTTCTT
ATGTGGATGG GCGAACAGAT TACCGAAAAG GGTATCGGCA ATGGCATCTC CCTGATCATC
TTTGCCGGTA TAGTATCGCG CCTGCCGGCA GGGGCGGCCA GCCTCTACCA GTACGTTACC
TCAGGAACGG TCAATATTAT TTCCCTGCTT GTCTTTGCCA TTGTGGCTGT GCTTATTATA
GCTGCCGTGG TGGCAGTACA GGAAGGGGAA CGCCGGATTG CCGTCCAGTA TGCCAAACGG
GTGGTGGGCC GGCGTGTCTA TGGTGGCCAG AGCACCCATA TACCCCTGAA AGTCAATCAG
GCAGGGGTTA TTCCCGTAAT CTTTGCCATG TCCATCCTGC TCTTTCCCAG TACCCTGGCG
TCCTGGTTTC CCCAGAGCAG TTTGGCCCAG ACAATAGTCC GGTTCTTCGA TCCCCGGTCG
GCTTTCTATA TGATCCTGTA TGCCCTGTTA ATTATCTTCT TTACCTATTT TTATACGGCT
GTGACCTTTA ACCCCCAGGA CGTGGCCGAT AATATGAAGA AATATGGTGG TTTTATACCG
GGCTTAAGAC CAGGGCGTCC TACGGCCGAG TATATTGAAC GGATCCTGGC CCGGGTAACC
CTGGCCGGGG CTATTTTCCT GGCGTTTATT GCCGTACTGC CCAATCTTCT CATGGCTATC
ACCGGGATCA ACGTCTATTT CGGCGGCACT TCCCTGCTGA TTGTCGTGGG TGTGGCACTG
GAAACCATGA AACAACTGGA ATCCCACCTG TTGTTGCGGC ACTACCAGGG CTTTATGAAA
TAA
 
Protein sequence
MLDTLASAWK LEDLRKKIFF TLLMFVVFRL GAHVPVPGIN NAILKELIGT GTIFGFFDVI 
SGGAFKRFTI FAMGIMPYIN ASIIMQLLTV VIPALERLAK EDIEGRKKIV QYTRYGTVIL
SILQALGMGL YLARSHAFLR PGLYNYLVVV IMLTAGTTFL MWMGEQITEK GIGNGISLII
FAGIVSRLPA GAASLYQYVT SGTVNIISLL VFAIVAVLII AAVVAVQEGE RRIAVQYAKR
VVGRRVYGGQ STHIPLKVNQ AGVIPVIFAM SILLFPSTLA SWFPQSSLAQ TIVRFFDPRS
AFYMILYALL IIFFTYFYTA VTFNPQDVAD NMKKYGGFIP GLRPGRPTAE YIERILARVT
LAGAIFLAFI AVLPNLLMAI TGINVYFGGT SLLIVVGVAL ETMKQLESHL LLRHYQGFMK