Gene EcolC_2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2947 
Symbol 
ID6065619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3214192 
End bp3215610 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content54% 
IMG OID641602359 
Productdeoxyribodipyrimidine photolyase 
Protein accessionYP_001725901 
Protein GI170020947 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.139043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACCC ATCTGGTCTG GTTTCGCCAG GATTTACGTC TGCACGATAA TCTCGCACTG 
GCTGCCGCCT GCCGCAATTC GTCTGCACGC GTGCTGGCGT TATATATCGC TACACCACGC
CAGTGGGCGA CGCATAACAT GTCGCCGCGT CAGGCTGAAC TTATCAATGC TCAACTGAAT
GGGCTACAAA TAGCGCTTGC GGAAAAAGGT ATTCCTTTAT TGTTCCGTGA AGTGGATGAC
TTTGTCGCCA GTGTCGAAAT AGTTAAACAG GTGTGCGCGG AAAACAGCGT TACCCACCTG
TTTTATAACT ATCAGTATGA AGTGAATGAG CGGGCGCGGG ATGTGGAAGT TGAAAGAGCG
CTGCGTAACG TGGTGTGTGA AGGATTTGAT GACAGCGTGA TCCTGCCGCC TGGCGCGGTG
ATGACCGGTA ATCACGAGAT GTACAAAGTC TTTACGCCTT TTAAGAATGC CTGGCTGAAA
CGGCTGCGGG AAGGGATGCC GGAGTGTGTC GCTGCGCCAA AAGTTCGTAG TAGCGGATCG
ATAGAGCCCG CGCCATCCAT TACGCTGAAT TATCCTCGTC AGTCTTTCGA TACTGCGCAT
TTCCCGGTGG AAGAAAAAGC GGCGATTGCG CAATTACGCC AGTTTTGCCA GAACGGTGCC
GGAGAATATG AGCAACAACG AGATTTTCCG GCAGTGGAAG GCACCAGTCG TTTGTCCGCC
AGCCTGGCAA CGGGCGGGTT ATCGCCTCGC CAGTGTTTGC ATCGCTTGTT GGCGGAACAG
CCGCAGGCGC TGGACGGTGG GGCCGGTAGT GTCTGGCTTA GTGAGCTGAT CTGGCGCGAG
TTCTACCGTC ATCTGATGAC GTATTACCCC TCGTTGTGTA AACATTGTCC GTTTATTGCC
TGGACGGATC GTGTGCAGTG GCAGAGCAAT CCCGCACATT TACAGGCCTG GCAGGAAGGC
AAAACGGGAT ACCCGATTGT CGATGCTGCC ATGCGTCAGC TTAACAGCAC TGGCTGGATG
CATAACCGGC TACGGATGAT TACGGCCAGT TTTCTGGTTA AAGATTTGTT GATCGACTGG
CGCGAAGGCG AGCGATATTT CATGTCGCAG CTGATTGATG GTGATTTGGC AGCCAATAAC
GGTGGCTGGC AGTGGGCCGC TTCAACCGGA ACCGATGCAG CGCCGTATTT TCGTATTTTC
AACCCGACAA CCCAGGGCGA GAAATTTGAC CGTGAGGGCG AGTTTATTCG TCGATGGTTA
CCGGAGCTGC GCGATGTACC AGGGAAAGCG GTGCATGAGC CGTGGAAGTG GGCGCAGAAA
GCAGGTGTGA TGCTGGATTA TCCGCAACCG ATAGTTGATC ACAAAGAGGC AAGGCTGCGA
ACGCTGGCAG CGTACGAAGA AGCGAGGAAA GGAGCCTGA
 
Protein sequence
MITHLVWFRQ DLRLHDNLAL AAACRNSSAR VLALYIATPR QWATHNMSPR QAELINAQLN 
GLQIALAEKG IPLLFREVDD FVASVEIVKQ VCAENSVTHL FYNYQYEVNE RARDVEVERA
LRNVVCEGFD DSVILPPGAV MTGNHEMYKV FTPFKNAWLK RLREGMPECV AAPKVRSSGS
IEPAPSITLN YPRQSFDTAH FPVEEKAAIA QLRQFCQNGA GEYEQQRDFP AVEGTSRLSA
SLATGGLSPR QCLHRLLAEQ PQALDGGAGS VWLSELIWRE FYRHLMTYYP SLCKHCPFIA
WTDRVQWQSN PAHLQAWQEG KTGYPIVDAA MRQLNSTGWM HNRLRMITAS FLVKDLLIDW
REGERYFMSQ LIDGDLAANN GGWQWAASTG TDAAPYFRIF NPTTQGEKFD REGEFIRRWL
PELRDVPGKA VHEPWKWAQK AGVMLDYPQP IVDHKEARLR TLAAYEEARK GA