Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0229 |
Symbol | secA |
ID | 3832557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 225695 |
End bp | 228385 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828165 |
Product | preprotein translocase subunit SecA |
Protein accession | YP_429107 |
Protein GI | 83589098 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) |
TIGRFAM ID | [TIGR00963] preprotein translocase, SecA subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0140186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCTGG GTATATTGCG TAATCTCCTT GACGATAATG CCCGGGATAT AAAGAAATTG AGCCGCCAGG TAGAGGCCAT CAACGCCCTG GAACCGGAGA TTCAGGCCCT AAGCGACAGC GACCTCCAGG CCAAAACCCC GGAGTTCCGT CGCCGCCTGG AGAGGGGCGA GACCTTGGAC GAGCTGCTGC CGGAGGCCTT TGCCGTTGTC CGGGAGGCCT CCCGGAGGGT GCTGGGGATG CGCCACTTTG ACGTCCAGCT TATGGGCGGC ATCGTCCTCC ACCAGGGACG CATCGCCGAG ATGAAGACCG GCGAAGGCAA AACCCTGGTG GCTACCCTGC CGGCTTACCT CAATGCCCTG ACGGGCAGAG GGGTGCATAT TGTTACCGTC AATGACTACC TGGCCAGACG GGACAGCGAA TGGATGGGTC GTATCTACCG TTTCCTGGGG CTGAAGGTGG GCCTGATCGT CCACGGCCTG GACGCCGCCG AGCGGCGGGA AGCGTATAAC GCCGATGTTA CCTACGGCAC CAACAACGAG TTCGGCTTCG ATTACCTGCG GGATAATATG GCCCTGCACC CGGAGGAGAT GGTCCAGCGG GAGCTCAACT ACGCCATCGT CGACGAGGTC GACAGTATCC TAATCGATGA AGCCCGGACG CCGCTCATTA TCTCCGGTAT GGCCGAAAAA CCCACGGAGA TGTATTACAC GGTGGCTGCC ATTATCCCGC GCCTGCAGCC GAACATTGAT TATAATGTCG ACGAAAAGGC CAAGGTGGCG ACCCTGACGG AGGCCGGGGT AGCCAAGGTG GAAAAGATGC TGGGGGTCGA CAACCTCTAC GACGACGCCA ATATGGAACT GGCCCACCAC GTTAACCAGG CCTTGAAGGC CCATACCCTG ATGAAGCGCG ACCGGGACTA TGTGGTCAAA GACGGCCAGG TCATCATCGT CGACGAGTTC ACCGGCCGCC TGATGTTCGG CCGCCGCTAC AGCGAAGGGT TGCACCAGGC CATCGAGGCC AAGGAAGGTG TGAAGATTGA GCGGGAGTCC CAGACCCTGG CTACCATCAC CTTCCAAAAC TTCTTCCGGA TGTATGACAA GCTGGCCGGC ATGACCGGCA CGGCGGCTAC CGAGGAAGAA GAGTTTCGCA AGATCTACAA CCTGGATGTG GTGGTTATCC CCACCAATAA GCCCATGATC CGCAAGGATT ACCCTGACGT CGTCTACCGG ACGGAAAAGG GTAAGTTCGA GGCAGTTGTC GAGGAGATCC GGGAACGCCA TGCAAAGGGC CAGCCGGTGC TGGTGGGCAC CATCTCCATC GAAAAGTCCG AGCGCCTGAG TGAAATGCTT AAAAAACGGG GGATCCCCCA CCAGGTGTTG AACGCCAAAT ACCACGAAAA GGAAGCGGAG ATTATCGCCC AGGCCGGCCG CCTCGGGGCG GTGACCATCG CCACCAACAT GGCCGGTCGC GGCACGGATA TTATCCTGGG AGGCAACCCT GAGGCCCTGG CCAAAGAAAA GATGCGCCAG AAGGGCTACA GTCCGGAAAT AATCGCCGCG GCTACGGCGA TCAAGGTCGA CGAGGGGGAC CCGGAGGTCA TGGCCGCCCG CCAGGACTAC CAGCAGTTCC TGGCTGCCGC CCGGCGGGAG ACGGAAGAGG AGCACCGGCG GGTGGTGGAA CTTGGCGGCC TGCATATCAT CGGCACCGAG CGCCACGAGA GCCGGCGGAT CGATAACCAG CTTCGCGGTC GTGCCGGGCG CCAGGGAGAC CCGGGGTCCA GCCGCTTCTA TGTCTCCCTG GAAGATGATT TGATGCGCCT CTTCGGCTCC GACAACCTGA CCGGTATCCT GGACCGTCTG GGTATGGACG ACAGCACCCC CATCGACCAT CCCCTGGTGT CCCGTTCCCT GGAGCAGGCC CAGAAAAAGG TCGAGGCCCA TAACTTTGAC ATCCGCAAGC ACGTCCTGGA ATATGACGAC GTCATGAATA AGCAGCGGGA GATTATCTAC CGCCAGCGGC GGGAGGTGCT TACGGGGGCC GACATCCGGC CGACTATCGA AGACATGATC AAAACCGTCG TCGACCAGAC GGTGGACCGC TTCGCCGGTG AGAGCAAGTA CCCGGAGGAG TGGGACCTGG CCGGGCTGCT CGATTATGCC GAGCAGTTGT TCCTGCCCGG GTGCGACCGC GAGGCCCTGG CCAACGCCAT CAAGGAGATG GAAAAGGAAG AGGTCTACGG CTTCCTGCGG GAAAAGGCTA TGGAGGCCTA CCGGCAGCGA GAAGAAGAAC TGGGCCCCGA AACCCTCAGG GAGATCGAAC GCCTGATTCT CCTCCGGGTG GTCGATACCA AGTGGATGGA CCACCTGGAC GCCATGGATC AGCTGCGCCA GGGCATCGGC CTCCGGGCCT ACGGCCAGCA GGATCCCCTG GTGGCCTATA AATTTGAGGC CTACCAGATG TTTAACGATA TGATCGCCTC CATCCAGGAA GACGTGGTGC GCTATCTGTA CCGGGTGAAG GTCGTCCAGC CGCAAGCGGA GCGTCCCCGC CAGGTGGTGG AGAACCGCTA TGCCGGTGAG GCTTCCGGTC CCCGGCAGCC GGTGCGCCGG GAGCATAAGG TAGGCCGCAA CGATCCCTGT CCCTGCGGCA GCGGCAAGAA GTACAAGAAG TGCTGCGGCG CCGGGAAGTA G
|
Protein sequence | MVLGILRNLL DDNARDIKKL SRQVEAINAL EPEIQALSDS DLQAKTPEFR RRLERGETLD ELLPEAFAVV REASRRVLGM RHFDVQLMGG IVLHQGRIAE MKTGEGKTLV ATLPAYLNAL TGRGVHIVTV NDYLARRDSE WMGRIYRFLG LKVGLIVHGL DAAERREAYN ADVTYGTNNE FGFDYLRDNM ALHPEEMVQR ELNYAIVDEV DSILIDEART PLIISGMAEK PTEMYYTVAA IIPRLQPNID YNVDEKAKVA TLTEAGVAKV EKMLGVDNLY DDANMELAHH VNQALKAHTL MKRDRDYVVK DGQVIIVDEF TGRLMFGRRY SEGLHQAIEA KEGVKIERES QTLATITFQN FFRMYDKLAG MTGTAATEEE EFRKIYNLDV VVIPTNKPMI RKDYPDVVYR TEKGKFEAVV EEIRERHAKG QPVLVGTISI EKSERLSEML KKRGIPHQVL NAKYHEKEAE IIAQAGRLGA VTIATNMAGR GTDIILGGNP EALAKEKMRQ KGYSPEIIAA ATAIKVDEGD PEVMAARQDY QQFLAAARRE TEEEHRRVVE LGGLHIIGTE RHESRRIDNQ LRGRAGRQGD PGSSRFYVSL EDDLMRLFGS DNLTGILDRL GMDDSTPIDH PLVSRSLEQA QKKVEAHNFD IRKHVLEYDD VMNKQREIIY RQRREVLTGA DIRPTIEDMI KTVVDQTVDR FAGESKYPEE WDLAGLLDYA EQLFLPGCDR EALANAIKEM EKEEVYGFLR EKAMEAYRQR EEELGPETLR EIERLILLRV VDTKWMDHLD AMDQLRQGIG LRAYGQQDPL VAYKFEAYQM FNDMIASIQE DVVRYLYRVK VVQPQAERPR QVVENRYAGE ASGPRQPVRR EHKVGRNDPC PCGSGKKYKK CCGAGK
|
| |