Gene TM1040_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1441 
Symbol 
ID4078071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1538768 
End bp1540192 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content62% 
IMG OID638006752 
Productdeoxyribodipyrimidine photo-lyase type I 
Protein accessionYP_613436 
Protein GI99081282 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.511514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.115222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACA CCCGACCGGT GATCTGGTGG ATTCGGCGCG ACCTGCGCCT GCGCGACAAT 
CCAGCCCTCA GGGCCGCGCA GGAGGCGGGC GGACCGATTA TCCCGCTCTA TATCCATGAC
GCGCAAGAAG AGGCCCTCGG GGCCGCGCCA AAGTTCCGCC TTGGCCTCGG GCTTGAACGG
TTTGCCAAAA CGCTCGAGGA GAAAGGCAGC CGTCTCATCG TCCGCCGTGG CGACGCACTC
GAGGTGCTGC GCGAGGTCAT CGCGGAGAGC GGCGCCGGAC ATGTGATCTG GTCGCGGCTC
TACGATCCGG CGGCCACAAA ACGTGACGCA AAGATCAAGG AAGCACTCAA AGCGTCTGAT
ATAAAGGCAA AATCCACGGG TGGGCGGCTG CTATTCGAAC CTTGGACCGT CGACACCAAG
GACGGCGGCA TGTACCGGGT ATATACGCCG TTCTGGAAGG CAGTCCGTAC GCGCGATGTC
GCGGAGCTGA CTGCCGCCCC CTCCCGGTTG GCCGCGCCCG AGAGTTGGCC GTCAGGTGAA
GCGCTCGCGG ATCTTGGCAT GGATGCCGCA ATGCGTCGGG GTGGCGCGAT TGTTGCGCAA
CATTGCCGTG TGGGAGAGGA CGCAGCCCTC GCGCAACTGG ATGATTTCCT GACGGAACGG
GTTTCGAACT ACAAAGCCCA TCGGGATTTT CCCGCCCGTG CTGCCACCTC GGAGCTTTCG
GAAAATCTCG CCTGGGGGGA AATCAGCCCA CATCGGATGT GGCATCTGGG CGCGCAGGCG
ATGCAGGACG GCCAGCCCGG CGCGGAGCAT TTCCTCAAAG AGGTTGTCTG GCGCGAGTTT
GCCTATCACC TGATGCATCG CTCGCCACGC ATCCTGACGC GAAACTGGCG AGAGGAATGG
GACGGTTTCC CCTGGCAGGA CGACCCTGGC GACGCTTTGC AGCGCTGGCA ACAGGGACAG
ACCGGCTATG ATTTTGTCGA TGCCGCGATG CGAGAGCTCT ACGTCACCGG CAAGATGCAC
AACCGCGCTC GGATGATCGT GGCAAGCTTT CTTACCAAGC ATCTGATGAT CCATTGGAAA
TACGGATTGC AGTGGTTCGA GGACTGTCTT GTCGACTGGG ATCCCGCTTC GAACGCGATG
GGCTGGCAGT GGGTGGCGGG TTCCGGTCCT GATGCGGCGC CCTTTTTTCG CATTTTCAAC
CCGGATGGTC AGCTCGAGAA GTTCGACCCC AAGGGAGACT ATGCGCGTCG GTGGCTGCCC
GAAGGCCAGG CCAACCCGCC CGAAACCGCG ATGGCATACT TTGAGGCGAT CCCGCGCAGG
TGGTCGCTTC ATCCCGACCA GTCGCGTGTT TCACCGATTA TCGGTTTGAA AGAGGGCCGC
GAGCGCGCGC TTTCTGCCTA CAAAGAAAGC CGCAATCAGA GCTGA
 
Protein sequence
MADTRPVIWW IRRDLRLRDN PALRAAQEAG GPIIPLYIHD AQEEALGAAP KFRLGLGLER 
FAKTLEEKGS RLIVRRGDAL EVLREVIAES GAGHVIWSRL YDPAATKRDA KIKEALKASD
IKAKSTGGRL LFEPWTVDTK DGGMYRVYTP FWKAVRTRDV AELTAAPSRL AAPESWPSGE
ALADLGMDAA MRRGGAIVAQ HCRVGEDAAL AQLDDFLTER VSNYKAHRDF PARAATSELS
ENLAWGEISP HRMWHLGAQA MQDGQPGAEH FLKEVVWREF AYHLMHRSPR ILTRNWREEW
DGFPWQDDPG DALQRWQQGQ TGYDFVDAAM RELYVTGKMH NRARMIVASF LTKHLMIHWK
YGLQWFEDCL VDWDPASNAM GWQWVAGSGP DAAPFFRIFN PDGQLEKFDP KGDYARRWLP
EGQANPPETA MAYFEAIPRR WSLHPDQSRV SPIIGLKEGR ERALSAYKES RNQS