Gene Arth_3714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3714 
Symbol 
ID4443715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4178751 
End bp4180055 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content61% 
IMG OID639691538 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_833189 
Protein GI116672256 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.162257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTT CAGTGAACGT CCCCCTCAAT TCACGCGGAA AACTCGCTGC ATCCTTGCCG 
GCCGAGCAGC TGGCAGAAAT TACCGCGCTG TTTGAGTTCC GGCGCACCGG CTACTCCCTC
GATGCCCCCT TCTACACCGA TCGCTCGATC TTCAACATCG ACATGGAGGC CATTTTCGGC
CAGCACTGGA TCTTTGCCGC GAGCACCGCC GAACTGCCGG AGCCGGGCGA CTACGTCACC
GTCGACTACG GGCCCTACTC CCTGATTGTG CTGCGCAACG ACGACGGCGC CGTGAACGTC
CTGCACAACG TATGCCGCCA CCGCGGCGCC CGCGTTCTCA CCGAAGCTGC CGGGTCAACC
GGAAACCTGG TCTGCGGCTA CCACTCCTGG ACCTACTCCC CGGAGGGTAA TTTGATCCAT
GCCTCCGCGC CGGGGGAAAC GAAGTTCGAC AAGAGCTGCT TCGGCCTCAA ACGCGCCCAC
GGCCGCGAAG TCGCCGGACT CATCTTCGTC TGCATTGCCG ATGATGCGCC GACGGACTTC
GATGAAACCG CAAAAATCTT TGAGCCCTAC CTGGCGCCCC ACGATCTCTC GAAGACAAAA
ATCGCCTACC AGCAGAACAT CATCGAAGAG GGCAACTGGA AGCTCGTCAT GGAGAACAAC
CGTGAGTGCT ACCACTGCGA CGGCCACCCT GAGCTCGCCT GCTCCCTCTT CCCCACCTGG
GGCCTGACGG AGGGCCTGAT CCCGCCCCAT CTTGAGGAAG TGTGGAACCG GAACAAGGAG
GCTCAGTCCT CCCTCGAGGA GCGTTGCCGC CGCTACGGCC TTCCCTACGA GGTGGTCGAG
CAGCTTGACA CGCGTATCGC GGGAATCCGT ATCTCACGGG AATCACTCGA TGGAGAGGGT
GAGTCGTTCT CCGCGGACGG GCGGAGGCTT TCCAAGAAGC TGCTGGGTGA TTTGCCCGAC
TTCCGCCTTG GCCGCTGCTC GATGCACCTG CAGCCCAACA GCTGGTTCCA TTTCCTCGGC
GACCACGTGA TCACGTTCGG CGTCTTTCCC ATCAACGAAC ACCAGAGCCT CGTACGCACC
ACCTGGTTGG TGGCTGACGA CGCCGTGGAA GGCGTCGACT ACGACCTGGA GAAGCTCACC
TACACCTGGA AGCAAACGAA CCTGCAGGAC AAGGCGTTCG TGGAGCTGTG CCAGCAGGGT
GCCGGCAGTC CCGCCTACGA GCCCGGTCCG TACATGAAGA GCGAATACCA GGTCGAGGCA
TTCATCAACT GGTACGTGCA GCGCGTGCAG GAGCACTTGG CATGA
 
Protein sequence
MTASVNVPLN SRGKLAASLP AEQLAEITAL FEFRRTGYSL DAPFYTDRSI FNIDMEAIFG 
QHWIFAASTA ELPEPGDYVT VDYGPYSLIV LRNDDGAVNV LHNVCRHRGA RVLTEAAGST
GNLVCGYHSW TYSPEGNLIH ASAPGETKFD KSCFGLKRAH GREVAGLIFV CIADDAPTDF
DETAKIFEPY LAPHDLSKTK IAYQQNIIEE GNWKLVMENN RECYHCDGHP ELACSLFPTW
GLTEGLIPPH LEEVWNRNKE AQSSLEERCR RYGLPYEVVE QLDTRIAGIR ISRESLDGEG
ESFSADGRRL SKKLLGDLPD FRLGRCSMHL QPNSWFHFLG DHVITFGVFP INEHQSLVRT
TWLVADDAVE GVDYDLEKLT YTWKQTNLQD KAFVELCQQG AGSPAYEPGP YMKSEYQVEA
FINWYVQRVQ EHLA