Gene Arth_2747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2747 
Symbol 
ID4444594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3088509 
End bp3089933 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content67% 
IMG OID639690567 
Productdeoxyribodipyrimidine photo-lyase type I 
Protein accessionYP_832226 
Protein GI116671293 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00995843 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCCA CCATTGTCTG GCTCCGTGAC GACCTTCGGC TCGATGACAA TCCGGCCCTG 
GCCGATGCGG CGGCCATGGG CCACCCGCTG ACCGTCGTCT ACATTCTGGA TGAAGAATCA
CCGGGGGTGC GGCCCCTCGG CGGGGCGGCC AAATGGTGGC TCCACCATTC ATTGGTGTCC
TTGGCCGGCG GTCTGGAAGC GGCAGGCTCC CGACTGGTCC TTCGACGCGG GAGCGCTGCA
GGAATCATCC AGGAGCTGGC TGCCGAAACC GGAGCCACCC ATCTCAGGTG GAACCGCAGG
TACGGCGGAC CTGAACGCAG CATCGACGCC GGCGTCAAAG CCTGGGCAGG GGAACAGGGA
CTTGATGCAG CGAGCTTCCA GGCCAGCCTC ATGTTCGAGC CCTGGACCGT CCGCACCGGG
GCGGGCGGGC CGTACAAGGT CTTCACGCCC TTCTGGCGCG CATGCCTCGA AAGCGGAGAG
CCGCGGATCC CCTCCGACGG CCCCGGCACG TTGCCTCATC CCGCCGGGCA CGGAGACGGC
GGGCCGCCCC AAAGCGATGA CCTGGACAGC TGGGCGCTTC TCCCCCGCAC GCCCGACTGG
AGCGCAGGAC TCGCGGAACA GTGGACGCCC GGCGAAGCGG GCGCCCACAG CCGTCTGAAG
GACTTCCTGG ACGGCCCTGT CGAGGAGTAT GGAACCGGCC GCGACCGGCC GGGAGTCGAA
GGCACCAGCC GCCTCTCCCC CCATCTTCGC TTTGGTGAGA TCAGTCCCTT CCGCATCTGG
CACGCGCTCC GTGAGCGCTT CCCGCGCCAG GCTCCTGCCG ACGTCGGAAT CTTCCGCTCC
GAACTGGGCT GGCGCGAGTT TTGCTGGCAG CTTCTCTACG AGAACCCGGA GCTGGCCAGC
CGAAACTACC GTCCCGACTT TGACCGGTTC GAATGGCAGA CGCCGTCGGA CGCCGAACTG
GAAGCCTGGC AGCAGGGCCG GACAGGCTAT CCGCTGGTGG ACGCCGGGAT GCGCCAGCTG
TGGCAGACGG GTTGGATGCA CAACCGCGTC CGCATGGCCG CCGCGTCGTT CCTGGTGAAG
AACCTGCTCG CGGACTGGAG GCTGGGCGAA GCCTGGTTCT GGGACACGCT GGTGGACGCC
GATTCCGCAA GCAACCCGGC CAACTGGCAA TGGGTGGCGG GCTCCGGAGC GGACGCCTCC
CCCTATTTCC GGATCTTCAA CCCCGTGACG CAAAGCAAGA AATTCGACGC CGCCGGCCGC
TACCTGCGGG AGTTCATTCC GGAGATCGCG AACCTGAGTG AAAAAGAGAT CCACGAACCG
TGGAAGGCGC CGGAACTGGC CGCCGGTTAT CCGGAGCCGT TGGTGGGCCT GCCCGAGTCG
CGTGAGCGGG CCCTGGAGAC ATACCAGAAG CTCAAGGACA GCTAA
 
Protein sequence
MPSTIVWLRD DLRLDDNPAL ADAAAMGHPL TVVYILDEES PGVRPLGGAA KWWLHHSLVS 
LAGGLEAAGS RLVLRRGSAA GIIQELAAET GATHLRWNRR YGGPERSIDA GVKAWAGEQG
LDAASFQASL MFEPWTVRTG AGGPYKVFTP FWRACLESGE PRIPSDGPGT LPHPAGHGDG
GPPQSDDLDS WALLPRTPDW SAGLAEQWTP GEAGAHSRLK DFLDGPVEEY GTGRDRPGVE
GTSRLSPHLR FGEISPFRIW HALRERFPRQ APADVGIFRS ELGWREFCWQ LLYENPELAS
RNYRPDFDRF EWQTPSDAEL EAWQQGRTGY PLVDAGMRQL WQTGWMHNRV RMAAASFLVK
NLLADWRLGE AWFWDTLVDA DSASNPANWQ WVAGSGADAS PYFRIFNPVT QSKKFDAAGR
YLREFIPEIA NLSEKEIHEP WKAPELAAGY PEPLVGLPES RERALETYQK LKDS