Gene Arth_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3520 
Symbol 
ID4443830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3958367 
End bp3959836 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content68% 
IMG OID639691344 
Producthypothetical protein 
Protein accessionYP_832995 
Protein GI116672062 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway)
[COG0684] Demethylmenaquinone methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAACAGG TCACCGACCA CCTCCTGGCC GCGGCCCGCA AAGTCATCGC GGTGCATATC 
AACTACCCCA GCCGGGCAGC CCAGCGCGGC CGCACTCCGG AGCAGCCCTC GTACTTCCTG
AAGCCGTCCT CCTCCCTGGC GATCGGGTCG GCAGAAGCCC CTTCAACGGT GGAGCGCCCG
GCCGGATGTG AGCTCCTCGG TTACGAAGGC GAGATCGCCC TGATCATCGG CAAGCCGGCC
CGCCGCGTGG GCATCGAGGA CGCCTGGAGC CATGTCGAAT GGGTCACCGC CAGCAACGAC
CTAGGCGTCT ACGACCTCCG CTACGCGGAC AAGGGCTCCA ACCTCCGTTC CAAGGGCGGG
GACGGTTTCA CGCCGATCGG CCCGGGGCTG ATCGCCGCCG ACGCCGTGAA CCCCGCACAA
CTGCGGATCC GCACCTGGCA TAACGGCGAA CTGGTCCAGG ACGACACCAC CGAAGACCTC
CTCTTTCCGT TCGCCCGGCT CATCGCGGAC CTGTCCCAGC TGCTCACCCT CGAAGAGGGC
GACATCATCC TCACCGGCAC CCCGGCCGGC GCTTCCGTCG CCAAGCCGGG CGACGTCATC
GAGGTTGAAG TCAGCACTCC TGACGCGACC ACCGGGCGGC TGGCCACCCG GGTGGAGGAA
GGCACGACGC CGTTCGCGGA CTTCGGCGCC CGCCCCAAGA CCGATGACCT CCAGCGGGAG
GAAGCATACG GTTCGCGGGA AGCGGCCGGG CTTGCCGCCG TCGGACCTGT CCTCTCGCCG
GAGCTGAAGG CCAAGCTGGA AAGCGTCTGC ACGGCCACGC TGTCCTCCCA GCTGCGCAAG
CGCGGCCTGA ACAACGTCAG CATCGACGGC CTCACCTCAA CGCGTCCGGA GAAGCGGATC
GTGGGCCTGG CCCGGACCCT GCGCTACGTG CCGAACCGCG AGGACCTCTT CAAGACCCAC
GGCGGCGGCT TCAACGCCCA GAAGAAGGCC ATCGACTCGG TCAACGAGGG CGAAATCCTG
GTGATGGAAG CCCGCGGCGA AAAGGGCACC GGCACCATCG GCGACATCCT GGCCCTCCGC
GCCCAGGTCC GCGGCGCCGC CGCCGTCATC ACCGATGGCG GCGTCCGTGA CTTCTCCGCT
GTGGCCGCCA TGGACATGCC CACGTACTAC TCCAACCCGC ACCCCGCGGT GCTGGGGCGC
CGGCACATCC CGTGGGACAC CGACATCACG ATCGCCTGCG GCGGCACCAC CGTACAGCCC
GGGGACATCA TCGTGGCCGA TGCGGACGGC ATCCTGGTGA TCCCGCCGGC CCTCGCCGAG
GAGCTTGCGG ACGATTCCAT CGCCCAGGAA CGCGAGGAGG CGTTCATCGC CGAGATGGTG
CAGCAGGGCC ACAGCGTGGA CGGCCTCTAC CCGTTGAACT CCGAATGGCG GGCCAAGTAC
GACGAATGGG AAGGCCCCGC ACATGACTGA
 
Protein sequence
MEQVTDHLLA AARKVIAVHI NYPSRAAQRG RTPEQPSYFL KPSSSLAIGS AEAPSTVERP 
AGCELLGYEG EIALIIGKPA RRVGIEDAWS HVEWVTASND LGVYDLRYAD KGSNLRSKGG
DGFTPIGPGL IAADAVNPAQ LRIRTWHNGE LVQDDTTEDL LFPFARLIAD LSQLLTLEEG
DIILTGTPAG ASVAKPGDVI EVEVSTPDAT TGRLATRVEE GTTPFADFGA RPKTDDLQRE
EAYGSREAAG LAAVGPVLSP ELKAKLESVC TATLSSQLRK RGLNNVSIDG LTSTRPEKRI
VGLARTLRYV PNREDLFKTH GGGFNAQKKA IDSVNEGEIL VMEARGEKGT GTIGDILALR
AQVRGAAAVI TDGGVRDFSA VAAMDMPTYY SNPHPAVLGR RHIPWDTDIT IACGGTTVQP
GDIIVADADG ILVIPPALAE ELADDSIAQE REEAFIAEMV QQGHSVDGLY PLNSEWRAKY
DEWEGPAHD