Gene Moth_1507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1507 
Symbol 
ID3831734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1551935 
End bp1553611 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content60% 
IMG OID637829439 
ProductDNA repair protein RecN 
Protein accessionYP_430359 
Protein GI83590350 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCAGG AACTGCAAAT CGAAAACCTG GCGCTAATCG AGTCCCTGCA GCTCAACCTG 
GAGCCGGGGC TGACGGTTCT TACCGGGGAA ACGGGTGCCG GCAAGTCCAT TATCGTCGAC
GCCGTGGGAC TGCTGGTAGG GGCCCGAGCC TCGGGCGAGT ATATCCGCGC CGGGGCGGAT
AAGGCGGTAG TCAGGGGCCT CTTCCAGGTA GCCGGCCTGC CGGGCCTTAA GGAAACCCTG
TCTGCCATGG GGGTGGCTGC GGAGGACGAC GGGACCCTGC TCCTTTGCCG GGAGATCAGC
CGCAGCGGTC GCCACAGCTG CCGGGTAAAC GGGCGCAGCC TGACTCTGGG TATGTACCAG
AGAATTGGCC AGATGCTGGT AGATATCCAC GGCCAGCACG CCTACCAGTC CCTCCTGCGG
CCGGCTTATC AAATGGATAT GCTGGATAGC CTGGCCGGCC TGATGGAGCT CAGGCAGGAG
GTCGGAGAGT TATATACCAG GTGGCAGGAT TTAAAAAAGG AACTGGAGGA ACTCTGCGGC
GACAGGGGGG AGCGGGAACG GCAACGTGAT CTCTGGCAAT ACCAGCTCCA GGAAATAGGC
GCCGCCAACC TTACCCCCGG GGAAGAGGAA GAATTGAGCC GCCAGCGGGA AATCCTCAAT
AACGGGGAAA GGCTGGCCAG GGGTGCGGCG GTCGTTTATG CTGCCCTCTT TGAGGAAGAA
GGCAGATCGG CCTACGATCA GCTCAGCCGG GCCTTGGCGG AACTTGAGGC CCTGGCAGCC
ATTGATCCCG GCTTGCAGAC CTGGCAGGGC ACCCTGGAGG GGATAACGGC CCAGGTCGAA
GAGATAGCCC GGAGCGTTCG CCGCTATGGG GAAGGGCTGG AGTACGATCC TGCCCGCCTG
CAGGAGATTG AAAACCGTCT GGAACTGATA AAAGATTTAA AACGCAAGTA CGGCGGCAGT
ATCGAGGCCA TTTTACAGTA CCAGGCGGAA ACGGCGGCCG CCCTGGAAAG GCTTGAGCAA
ATGGCGGGAC AGGCGGCCGC CCTGGAAAAG GAGATCGAAC TGGCCGCGGA AAAATACAGG
GAAAAAGCCT TGCTTTTACG CCGGCGGCGT ATGGAAGCAG CGCAAAAAAT CGAAAAAGAG
CTGCTGAAAG TCCTTAAGGA TCTGGCCATG CCGGCAGCCA GGATCAGGGT GGACTGCAGC
GAAGCACCAC AGCCAGGCCC AGCGGGTATG GACAACATTA CATTTCTTTT TCAACCCAAT
CCCGGAGAGG GCAGCCGGCC CCTGGCCCAG ATAGCTTCCG GAGGGGAGAT GGCCCGGGTA
ATGCTAGCCT TAAAGAGTAT TCTGGCCGAT GTTGACGCCG TGCCAACATT GATATTTGAT
GAGATTGACG CCGGCGTTGG CGGTGCTGCG GCCCGGGCGG TGGCCAGGAC CCTGGCGGCC
ATAGGCCGCC GGCGACAGGT CCTTTGCATT ACCCACTCGG CCCAGCTGGC CAGTTTTGCC
GGGCAGCACT TCCGGGTTAG CAAGGAGGTG CGGGAAGGCC GCACCTATAC CCGGGTCGAT
GTTCTCCGGG GTGAAGAGCG GGTTGATGAG CTGGCCCGGT TATTATCCGG TTCCGCCAGC
AGTGTGGCGC GAGAACATGC CGCTGCCCTT TTACAACAAT CAGGGTCCAT AAAATGA
 
Protein sequence
MLQELQIENL ALIESLQLNL EPGLTVLTGE TGAGKSIIVD AVGLLVGARA SGEYIRAGAD 
KAVVRGLFQV AGLPGLKETL SAMGVAAEDD GTLLLCREIS RSGRHSCRVN GRSLTLGMYQ
RIGQMLVDIH GQHAYQSLLR PAYQMDMLDS LAGLMELRQE VGELYTRWQD LKKELEELCG
DRGERERQRD LWQYQLQEIG AANLTPGEEE ELSRQREILN NGERLARGAA VVYAALFEEE
GRSAYDQLSR ALAELEALAA IDPGLQTWQG TLEGITAQVE EIARSVRRYG EGLEYDPARL
QEIENRLELI KDLKRKYGGS IEAILQYQAE TAAALERLEQ MAGQAAALEK EIELAAEKYR
EKALLLRRRR MEAAQKIEKE LLKVLKDLAM PAARIRVDCS EAPQPGPAGM DNITFLFQPN
PGEGSRPLAQ IASGGEMARV MLALKSILAD VDAVPTLIFD EIDAGVGGAA ARAVARTLAA
IGRRRQVLCI THSAQLASFA GQHFRVSKEV REGRTYTRVD VLRGEERVDE LARLLSGSAS
SVAREHAAAL LQQSGSIK