Gene Arth_4060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4060 
Symbol 
ID4447791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4582718 
End bp4583830 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content65% 
IMG OID639691891 
Productoxidoreductase domain-containing protein 
Protein accessionYP_833535 
Protein GI116672602 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCGT CACAGGAGTC CTACCGCCGG GACCTTGACC TTATGGCATC CGCGGCACCG 
CTGCGGGCCG CCGTGATCGG TGCCGGCTAC TGGGGGCCAA ACCTTGCCCG GAATTTCAAG
GCCAGCCCGG ACTGGCAACT TGCAGCGATC GTGGACATGG ACCGCGACCG GGCCGCCAAG
CTTGCGGCAG CCCACGGCGG CGTGCCGGTC TGCGAATCAA TTGACGAACT GCTGGACACC
GTTGACGTCG ACGCGGTGGC CATTGCTACT CCGGCGCACA CCCACCACGG GATCGCCCTG
ACAGCGTTGC GCGCGGGAAA GCACGTGCTT GTGGAAAAGC CCCTGGCCGA CAGCAGGGCC
AAAGGCGTGG AGATGGTCGA AGAAGCAGAG AACCGCGGGC TGGTCCTGAT GGCCGACCAT
ACGTACTGTT ACACCCCCGC CGTCCTGAAG ATCCGCGAAC TGATCGCGGA GGGCTCGTTG
GGCGAGATCT TGTTCATCGA CTCGGTGCGC ATCAACCTCG GGCTTGTGCA GCCTGACGTT
GACGTGTTCT GGGACCTGGC TCCGCACGAT CTGGCCATCA TCGATTTCAT CCTGCCCGGC
GGCCTCCGTC CCGCTGAAGT GGCCGCCCAT GGAGCGGATC CGCTGGGAAC CGGACGGGAC
TGCGTGGGGC ACCTGACGTT TGCGCTGCCG AACGATGCCA TGGCGCATGT GCACGTGAAT
TGGCTCAGCC CTACCAAGAT CCGCCAGATG GTGGTGGGTG GTTCCCAGCG GACCCTCGTC
TGGGATGACC TGAATCCGCA GCAACGGCTG AGTGTGTACG ACCGCGGCGT CAGCCTGGAC
CGGAAATACC GTTCGCCCGC GGAGAAGAAG GCATTCGCCA TTTCCTACCG GCTGGGTGAC
ACATGGGCGC CTGCGCTGCC GGAACACGAG GCCCTCGGCC AGATGGTGGC GGAATTCGCC
AGCAGCATCT GGCATCACCG GCCTGCGCGG ACCAGCGGTA CCTCCGGGTT GCGGGTGCTC
TCCGTTCTGG AAGCGGTCAG CCGCAGCCTC AGCGGTGATG GGGCCTCGGT CGCCGTCACG
GGCAACGAAA CCCAGTTGGA GGGACGGCGA TGA
 
Protein sequence
MESSQESYRR DLDLMASAAP LRAAVIGAGY WGPNLARNFK ASPDWQLAAI VDMDRDRAAK 
LAAAHGGVPV CESIDELLDT VDVDAVAIAT PAHTHHGIAL TALRAGKHVL VEKPLADSRA
KGVEMVEEAE NRGLVLMADH TYCYTPAVLK IRELIAEGSL GEILFIDSVR INLGLVQPDV
DVFWDLAPHD LAIIDFILPG GLRPAEVAAH GADPLGTGRD CVGHLTFALP NDAMAHVHVN
WLSPTKIRQM VVGGSQRTLV WDDLNPQQRL SVYDRGVSLD RKYRSPAEKK AFAISYRLGD
TWAPALPEHE ALGQMVAEFA SSIWHHRPAR TSGTSGLRVL SVLEAVSRSL SGDGASVAVT
GNETQLEGRR