Gene Arth_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2052 
Symbol 
ID4445426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2313055 
End bp2314188 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content67% 
IMG OID639689860 
Productcupin 2 domain-containing protein 
Protein accessionYP_831532 
Protein GI116670599 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.748893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCATCA GCGCCGAGAA CACGACTCAT GAATCAGTGG CCGCGGGGCA CACTGCTCCG 
GAGCCGACGC CTGAAGAAGC TGCCCAGCTC GAGGAGCTGT ACCGGGATTT TGACCGGGAG
AACCTGATCC CGTTGTGGAC TGAGATCGGT GACCTGATGC CGATGGTCCC CTCCCCGAAG
GCGGTGCCGC ATGTGTGGCG GTGGAGCGAC CTGTACCCGC TGGCCGCCCG CGCAGGTGAC
CTGGTGCCGG TGGGCCGCGG CGGGGAACGC CGCGCCATTG CCCTCGCCAA CCCTGGTTTG
GGCGGCACGG CCTATGCCAC GCCCACTCTG TGGGCAGCCA TCCAGTACCT GGGCGGCCAC
GAAACAGCCC CCGAGCACCG CCATTCCCAA AACGCGTTCC GCTTCGTCGT CGAAGGTGAA
GGCGTGTGGA CCGTGGTGAA CGGGGACCCG GTCCGGATGT CCCGCGGTGA TTTCCTGCTG
ACCCCGGGCT GGAACTTCCA CGGCCACCAC AACGACACCG ATGAGCCGAT GGCCTGGATC
GACGGCCTGG ACATCCCGTT CGTGCACTAC GCGGACGCCG GGTTCTTCGA GTTCGGCACC
GAACGGGTCA CCGACGAGGC CACCCCGGAC ATCTCCCGCT CCGAGCGGCT CTGGGCCCAC
CCGGGCCTGC GCCCGCTCTC CGGCCTGGAT GACACCACCA GCTCCCCCAT CGCCGCGTAC
CGGTGGGAAT ACACTGACCG TGCCCTGGCC GAGCAACTTT TGCTCGAGGA CGAGGGCCAC
CCGGCCACCG TGTCCCAGGG CCACGCCGCT GTCCGTTACA CCAATCCCAC CACCGGCGGG
GACGTGATGC CCACCATCCG GGCCGAATTC CACCGCCTCC GGCCCGGCGC GTCCACCCAG
GGCGTCCGCG AGGTCGGCTC CAGCGTCTGG CAGGTCTTCG AAGGGACCGG TGCCGTTGTT
CTCAACGGCG AACCCCGGAC CCTGGAAAAG GGCGACCTCT TCGTTGTCCC GTCCTGGGCT
GAATGGTCCC TGCAGGCTGA GAGCGGGTTT GATCTGTTCC GGTTCAGCGA CGCCCCCATT
TTTGAACGAC TGAACTTCAA CCGCACCTAC ATCGAAGGAC GCAAGAACGC ATGA
 
Protein sequence
MSISAENTTH ESVAAGHTAP EPTPEEAAQL EELYRDFDRE NLIPLWTEIG DLMPMVPSPK 
AVPHVWRWSD LYPLAARAGD LVPVGRGGER RAIALANPGL GGTAYATPTL WAAIQYLGGH
ETAPEHRHSQ NAFRFVVEGE GVWTVVNGDP VRMSRGDFLL TPGWNFHGHH NDTDEPMAWI
DGLDIPFVHY ADAGFFEFGT ERVTDEATPD ISRSERLWAH PGLRPLSGLD DTTSSPIAAY
RWEYTDRALA EQLLLEDEGH PATVSQGHAA VRYTNPTTGG DVMPTIRAEF HRLRPGASTQ
GVREVGSSVW QVFEGTGAVV LNGEPRTLEK GDLFVVPSWA EWSLQAESGF DLFRFSDAPI
FERLNFNRTY IEGRKNA