Gene Tmz1t_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3951 
Symbol 
ID7873597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4347671 
End bp4349251 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content75% 
IMG OID643700888 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_002890911 
Protein GI237654597 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCTACG GCCTGGTCTG GTTCAAGCGC GACCTGCGCC TCGCCGATCA CGCCGCCTTG 
GCCACGGCCG CGCGGCGCGG GCCGGTGCTG TGCGTGCTGA TCGTCGAGCC GTCGCTGTGG
GCGCAGCCCG ACGCCGCGCG CCAGCACTAC GAGTTCATGC TCGAAAGCGC ACGCGAGCTG
CACGCCGGGC TGGCCCGCGT CGGCGGTCGC CTGCACCTGC TGGTGGGCGA GGCCGTCGCG
GTGCTCGATC GCCTGCACGC CGCAGCGCCC TTCGACACCC TGCATTCGCA CGAGGAGACC
GGCAACGCCG CGAGCTACGC GCGCGACCGC GCGGTGGCGC GCTGGTGCCG GGCGCGCGGC
GTGCGCTGGC ACGAGCCGGC GCAGTTCGGC GTGGTGCGCC GGCTCGACGA CCGCGACCGC
TGGCAGGCAG CGTGGGAGGC GCAGGTCGCC GCGCCGCAGG TCGAGCTGCC CGAGCCCTCG
CGGCTGCGCT TCGTTGCGTT GCCCGCTGCC CTGCAGCCGG GCGCCGGCGC GACCTGGCCG
GATCGCGGCG CGATCGCCGC CGTGCGTGCG CCGGCGGCCG TGGCCCTCGG GCTCGACGCC
TTCGAGCCGC CCCGGCGCCA GCGTGGCGGG CGGCACGCGG CGCTGGAGGT GCTGCACGAC
TTCCTCGACG CGCGCAGCGG GCAATACCGC GGCGGCATCT CCTCGCCGCT GAAGGCACCC
ACCGCGTGCT CGCGGCTGTC GCCCTACCTG GCCTGGGGCT GCCTGAGCCT GCGCGAACTG
GTGCAGGCCA CCCGCGCGCG CGTCGCCGCG CTGCCCGAGG GCGACCGCCG CCGCGCCGGC
CTGGCGGCCT TCCTCAGCCG CCTGTACTGG CACTGCCACT TCATCCAGAA GCTGGAGAGC
GAGCCGACGC TGGAGTTCCG CAACCTGCAC CGCGGCTACG ACGGCCTGCG CGAGCCGGAA
TGGAACCAGG CGCATTTCGA CGCGCTGGTG GGCGGGCGCA CCGGCTGGCC ACTGGTCGAC
GCCTGCGTGG CGATGCTGCG CGCGACCGGC TGGCTCAACT TCCGCATGCG CGCGATGCTG
GTGTCGGTGG CGGCCTACCC GCTCTGGCTG CACTGGCGCG AGGTCGGCCT GTGGCTGGCG
CGCGCCTTCC TCGACTACGA GCCCGGCATC CACTGGAGCC AGCTGCAGAT GCAGTCCGGC
ACCACCGGCA TCAACACCAC CCGGGTGTAC AACCCGATCA AGCAGGCGCG CGACCACGAC
CCGCAGGGCG TGTTCGTGCG GCGCTGGCTG CCGGCACTGC GGCGGGTGCC GGACACCTGG
CTGTTCGAGC CCTGGCGCAT GCCGGAGTCG GTACAGGCGC GCTGCGGCGT GCGCGTCGGC
GAGGACATCG CGTTGCCGGT GGTCGATCTG GAGAGCGCCA CGCGCGCCGC CAAGACGCGC
ATCCACGCAC TGCGCGCCCA GCCCGAGGTG CGCGCGGCGA AGGCGGCCAT CGTCGAGCGC
CACGGCTCGC GCAAGCCGCC GCAGGGGCGG CGCAAGACGG CGGCGGGGTC GGCGTCGGGA
CAGCTGGACC TGGGGTTTTG A
 
Protein sequence
MSYGLVWFKR DLRLADHAAL ATAARRGPVL CVLIVEPSLW AQPDAARQHY EFMLESAREL 
HAGLARVGGR LHLLVGEAVA VLDRLHAAAP FDTLHSHEET GNAASYARDR AVARWCRARG
VRWHEPAQFG VVRRLDDRDR WQAAWEAQVA APQVELPEPS RLRFVALPAA LQPGAGATWP
DRGAIAAVRA PAAVALGLDA FEPPRRQRGG RHAALEVLHD FLDARSGQYR GGISSPLKAP
TACSRLSPYL AWGCLSLREL VQATRARVAA LPEGDRRRAG LAAFLSRLYW HCHFIQKLES
EPTLEFRNLH RGYDGLREPE WNQAHFDALV GGRTGWPLVD ACVAMLRATG WLNFRMRAML
VSVAAYPLWL HWREVGLWLA RAFLDYEPGI HWSQLQMQSG TTGINTTRVY NPIKQARDHD
PQGVFVRRWL PALRRVPDTW LFEPWRMPES VQARCGVRVG EDIALPVVDL ESATRAAKTR
IHALRAQPEV RAAKAAIVER HGSRKPPQGR RKTAAGSASG QLDLGF