Gene Moth_1515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1515 
Symbol 
ID3831980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1560632 
End bp1561840 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID637829447 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_430367 
Protein GI83590358 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAGG TTAAAGTCCT GGCGGTGAGG GAACTCACCG CCTATTTGCA GCGCCTCCTG 
GCCAACGACG GCCGCCTGGC CAATGTCTGG GTAAAGGGCG AGATCTCCAA CCTGCGCTCT
CCAACCTCGG GCCATCTCTA TTTCAGTCTC AAGGACCAGG CTGCGACCCT GCGCTGCGTG
ATGTTTCAGG GCCGTAGCCG GGGCCTATCC CTGGGCCTGC GCGACGGCCT GGAGGTAATC
GCCAGGGGTC AGGTGGCCAT TTACCCCCGG GACGGCGTCT ACCAGCTCTA TGTAGCCGAA
ATCTTCCCGG CAGGGACCGG CCTGGCCAGC CTGGCCCTCC AGGAGTTGAC CGCCCGCCTG
GAACGGGAGG GTCTCTTTGC CGCCGACCGC AAGCGGCCCC TGCCCCTGTT GCCACGCCGG
GTGGGCCTGG TGACCTCCCC AACCGGGGCC GCCCTACGGG ATATGATAAC CATCAGCCGG
CGCCGCTTCC CCGGGATTGA ACTGATCCTG GCGCCGGCCA GGGTCCAGGG GGAGGTGGCG
CCCCGCCAGC TGGCCCTGGC CCTGGAACTT CTGGCAAAAA GGGGAGGCGT TGATGTCATT
ATTATCGGCC GCGGTGGCGG CTCGGCCGAA GATTTAAGCG CCTTCAACAC CGAACTGGTG
GCCAGGGCCA TCTATGCCTG TCCGGTACCC GTCATTGCTG CCGTAGGCCA TGAGACGGAT
CTCACCCTGG CCGACAGGGT AGCCGACCGG CGGGCGCCTA CTCCTTCGGC GGCGGCGGAA
ATGGCCGTAC CGGTGCGGGC AGAACTGGAA CAGCGCCTGA AGAGCCTGGC GGAGCGGGCC
CGCCGCGGTA TGGAACACCG CCTGGAGTTA GCCCGGGCTC GGCTGGAGCG CCTGACCAAA
AGCAGCGGCC TGGATCGGCC CCGCCAGGAG CTATATTACC GCCAGCAGTA CGTGGATGGC
CTGGAACAGC GCCTGCTGGC CTCCTGGGAG CGTCGTTCCC GGGAGCGGGA GCAGGGTCTT
AATCTCCTGG CTGCCCGCCT GGAAGCCGCC AGCCCCCTGG CCATCCTGGC GCGTGGCTAC
GCCGTTTGCC GCCGTCCCGG CGACGGAGCG CCCCTGAAAT CCAGCCGGGA AGTGCTCCCC
GGGGAGAAGG TGGAGGTCAT CCTGAAGGAA GGCCTTCTCC GGTGCCAGGT CGAAGAAGTC
GGCGGATAG
 
Protein sequence
MPEVKVLAVR ELTAYLQRLL ANDGRLANVW VKGEISNLRS PTSGHLYFSL KDQAATLRCV 
MFQGRSRGLS LGLRDGLEVI ARGQVAIYPR DGVYQLYVAE IFPAGTGLAS LALQELTARL
EREGLFAADR KRPLPLLPRR VGLVTSPTGA ALRDMITISR RRFPGIELIL APARVQGEVA
PRQLALALEL LAKRGGVDVI IIGRGGGSAE DLSAFNTELV ARAIYACPVP VIAAVGHETD
LTLADRVADR RAPTPSAAAE MAVPVRAELE QRLKSLAERA RRGMEHRLEL ARARLERLTK
SSGLDRPRQE LYYRQQYVDG LEQRLLASWE RRSREREQGL NLLAARLEAA SPLAILARGY
AVCRRPGDGA PLKSSREVLP GEKVEVILKE GLLRCQVEEV GG