Gene Moth_0256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0256 
SymboluvrC 
ID3833219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp262415 
End bp264256 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content61% 
IMG OID637828192 
Productexcinuclease ABC subunit C 
Protein accessionYP_429134 
Protein GI83589125 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTGG AGGAGAAACT GGCGCGCCTG CCGGACCACC CCGGCGTATA TATAATGCAC 
GATGCCAACG GGGCAATAAT CTATGTCGGC AAGGCGGCTT CCCTGAGGAA TCGGGTGCGT
TCCTACTTCC GCGGCCAGCA CCAGCCGCGG ACGCAAGCCA TGGTCAGCCA CGTTGCCGAC
TTTGAGTATA TCTTGACGGA CAACGAAGTC GAGGCCCTGA TCCTGGAGTG CAACCTGATC
AAACAACACC GGCCGCGGTA TAACGTCAGC CTGAAGGACG ACAAGAGTTA CCCCTATATC
AAGATAACCA CCCAGGAGGA TTTTCCCCGG ATCCAGATTA CGCGTTCCGT GACCCGTGAC
GGTTCCCGTT ACTTCGGACC TTATACCAGC GCCGGTTCCC TGAAAGAAAC CCTGAAGCTC
CTGCGCGGCC TTTTTCCCAT CCGGACCTGC AGGGATACCC CCCTGCAACC CCGCAGCCGT
CCCTGCCTCA ACGCCCATAT CGGCCGCTGC CTGGCCCCCT GTGCCGGCCA GGTCGACCGG
GAGACCTACC GGGAGGCGGT CGATAATGTC ATTATGTTCC TGGAAGGCAG GCATACGGCC
CTGGTTAAGG AGCTGAAGGA GCAAATGGAA GCCGCCGCCG CGAGACTGGA GTTTGAAAAG
GCGGCCAGGC TCCGGGACCA GCTCCGGGCG GTACAGGAGG TCTGTGAAAA GCAGAAACTG
GCCGCCGCCA GCGGGGAAGA CGCCGACGCC ATCGCCTTCG CCCGGGAAGG GGAGGCTGCC
CTGGGGCTCA TCTTTTTTAG CCGGGGCGGC AAGGTAATCG GCCGGGATCA CTTCTTCCTA
ACAGGGAGCG AAGGGTTATC CCGGGGGGAG GTTATGGCGG CCCTGCTAAA AGAGTATTAT
AGCCGGGGAG TAGAGATACC GCCGGAGATC CTCCTCCACG ACGAACCGGA GGATGCCGCC
ACCATCGCCA GCTGGTTGAG CCGGCTCCGT GGCGGCAGGG TTAACCTGCG GGTGCCCAAA
AGGGGTACGA AATTAAAACT CCTCCGGCTG GTTCACGAGA ACGCCGTAAG CCTCCTCCAG
GAGCACCTGC TGACCCGCCG GCGCCAGGAG GAGGGCAGCA GGGCGGCCCT CCTGGAACTC
CAGGAAATCC TGGAGTTACC GCGCTTGCCG CGGCGGATGG AGGCCTACGA TATCTCTAAC
TTCCAGGGGA GCTCCCAGGT GGGAGCTATG GCCGTCTTTG TTGACGGCCG GCCGCTGCCT
TCGGCGTACC GCCGGTTTCA GATTAAGACT GTCCGGGGGC CCAACGACTT CGCTTCCCTG
CAGGAGGTTT TGAGCCGTCG TTTCCGGCGG GCTGCCGAAC AGGACCCCCA TTTTGCCGAT
TTGCCGGATT TCGTCCTGAT TGACGGCGGC CTGGGCCAGC TCCACGCCGC CCGGGAGACC
ATGGAAGCCA TGGGGGTAGG GTATATTCCC ACCTTTGGCC TGGCCAAGGA GGAGGAACTG
TTATTCCGGG TGGGCACCTC CGAGCCCATC CGCCTGCCCC GTGAGAGCAA GGCCCTGCAA
ATCCTGCAAC ACCTCCGGGA TGAGGTGCAC CGCTTTGCCA TCACCTATCA CCGGCAAAAG
CGGGAAAAGA CAGCCTATCG CTCGGTCCTG GACGACATTC CCGGCGTAGG CCCCAAGCGT
AAGAAGGCAT TATTACGTCA TTTTGGTTCC GTAGCAGCCA TCAGCAAAGC GACGCTGGAA
GATTTACTGG CCGTAGAGGG GATGAACCGG ACCGTGGCGG CCCGCATCCT GGCCGGCCTG
GGGAGGAGAA GTGATGGGGA AGATTCGACT GGTAGCCCTT GA
 
Protein sequence
MDLEEKLARL PDHPGVYIMH DANGAIIYVG KAASLRNRVR SYFRGQHQPR TQAMVSHVAD 
FEYILTDNEV EALILECNLI KQHRPRYNVS LKDDKSYPYI KITTQEDFPR IQITRSVTRD
GSRYFGPYTS AGSLKETLKL LRGLFPIRTC RDTPLQPRSR PCLNAHIGRC LAPCAGQVDR
ETYREAVDNV IMFLEGRHTA LVKELKEQME AAAARLEFEK AARLRDQLRA VQEVCEKQKL
AAASGEDADA IAFAREGEAA LGLIFFSRGG KVIGRDHFFL TGSEGLSRGE VMAALLKEYY
SRGVEIPPEI LLHDEPEDAA TIASWLSRLR GGRVNLRVPK RGTKLKLLRL VHENAVSLLQ
EHLLTRRRQE EGSRAALLEL QEILELPRLP RRMEAYDISN FQGSSQVGAM AVFVDGRPLP
SAYRRFQIKT VRGPNDFASL QEVLSRRFRR AAEQDPHFAD LPDFVLIDGG LGQLHAARET
MEAMGVGYIP TFGLAKEEEL LFRVGTSEPI RLPRESKALQ ILQHLRDEVH RFAITYHRQK
REKTAYRSVL DDIPGVGPKR KKALLRHFGS VAAISKATLE DLLAVEGMNR TVAARILAGL
GRRSDGEDST GSP