Gene Moth_1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1928 
Symbol 
ID3830852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2000725 
End bp2001747 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content48% 
IMG OID637829860 
Productspore photoproduct lyase, splB 
Protein accessionYP_430770 
Protein GI83590761 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID[TIGR00620] spore photoproduct lyase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0376642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGTTG AATTAAAACG GGTTGTCTTC GAACCGGAAG CTTTGAACTA TCCCCTGGGG 
CGAAAACTAT ATCAACGCTT CCATGAAGAA AGAGTCGAAG TATTGATGAC CCCATCTCAC
AACCGTGTTA CTTGTATCCC GGGCAAAACA GCCCGGGAGA GTTTTCTGGA AGCTAAACGT
ACCCTGGTAG TAGGGGTGCG AAGGAGCAGG GATTTTCAAA CCTGCAAGCC CTCGGCCCAT
TACCAGTTAC CCCTGGTCAC AAGTTGTCCG GCCATGTGTG AATACTGTTA CCTGTTTACT
CATTTTGGGC GTAAGCCCTA TCAAAAGATT TATGTTAATA TCGCTGAAAT CCTCGCTCTG
GCCCGGGATT ATATTAACCG GCGCGACCCC GAAGTAACAT ACTTTGAAGC TTCAGCCACC
TCAGATCCCG TGCCGGTAGA AAAGTATACC GGCAGCCTTG CCGCTGCTAT TGAATTTATG
GCCAGGCAAC CCCTGGGGCG CTTGCGGGTT GCTACTAAAT TTACTGATGT AGACGGGCTA
TTAAACCTGG ACCACCGGGG CCATACCCGC TTTCGCTTTA GTATCAATGC AGAAAACATA
ATTAAGCGTT TTGAGCATGG TACCCCGCCC CTGGGGCAAC GGCTGGCGGC GGCGGCACAG
ATGGCCGGGG GAGGTTACCT GACAGGTTTC ATTATTGCTC CTATATTTTA TTTGGAAGGG
TGGCAGCAGC AATATCGCCA TCTTTTTCAA GTAATTGCCA GGCAACCGTT GCTTGCCGAA
TCGAATGACC TGACATTAGA GCTAATAACC CATCGCTTTA CCAAACGGGC CAAGACTTCC
ATTGAAACGC TATTTCCCAA TACAAAATTG CCTCTAGATG AAGAAGAGAG GACTTTTCGG
TATGGTCAAT TTGGTTATGG TAAGTATGTT TACCCCGCTG AAGTAAGAAC GGCGCTGGAG
GCTTTTTTCA AAGAAATGGT AGCTACTTAT TTACCGAGGG CTAAAGTTGA GTATTTTATA
TAG
 
Protein sequence
MTVELKRVVF EPEALNYPLG RKLYQRFHEE RVEVLMTPSH NRVTCIPGKT ARESFLEAKR 
TLVVGVRRSR DFQTCKPSAH YQLPLVTSCP AMCEYCYLFT HFGRKPYQKI YVNIAEILAL
ARDYINRRDP EVTYFEASAT SDPVPVEKYT GSLAAAIEFM ARQPLGRLRV ATKFTDVDGL
LNLDHRGHTR FRFSINAENI IKRFEHGTPP LGQRLAAAAQ MAGGGYLTGF IIAPIFYLEG
WQQQYRHLFQ VIARQPLLAE SNDLTLELIT HRFTKRAKTS IETLFPNTKL PLDEEERTFR
YGQFGYGKYV YPAEVRTALE AFFKEMVATY LPRAKVEYFI