Gene Arth_4363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4363 
Symbol 
ID4443474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp102943 
End bp104406 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content61% 
IMG OID639687684 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_829381 
Protein GI116662327 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000470921 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGATC AGACCAGCGT CATCCAGGAC GTGCAACGAG GCATGATTCC TGCACACATC 
TACAACGACA AGGAGATCTT CGAACTCGAA AAGGAGCGCC TCTTCGGACG GAGCTGGCTC
TTCGTCGCCC ATGAGTCGGA GGTGCCAGAA GCCGGCGACT ACGTGGTGCG CCGCGTGCTG
GAAGACTCAT TCATCATTTC CCGCGACGAG CAGGGCGAAA TCCGCGCCCT CTTCAATATG
TGCCTGCACC GCGGAATGCA GGTCTGCCGG GCCGAAATGG GAAACGCCTC GCATTTCCGT
TGCCCCTACC ACGGCTGGTC CTACCGCAAT GACGGACGCA TTGTGGGCCT GCCGTTCCAT
AAGGAGGCCT ACGGCGGCGA AGAGGGCTTC AAGAAAAAAG GGCAGACCCT GCTTCCCGCC
CCGTCCCTGG GCGTATATAA CGGGCTGATT TTCATTAGCC TTGACCCCGA CGCGGAACCC
CTCGAGGACT TCCTGGGCGA CTTCAAGTTC TACATGGACT ACTACACCAA GCAAAGTGCT
GACGGCATTG AACTCCGCGG CCCTCAGCGG TGGCGGGTCA AGGCGAACTG GAAGATCGGT
GCCGAAAACT TCGCCGGCGA CATGTACCAC ACGCCCCAAA CGCACACGTC GGTGGTTGAA
ATTGGCCTCT TCCGCGAGCC AAAGGCGGAG AAGCGCAAGG ATGGCACAAC GTACTGGGCC
GGTAACGGCG GCGGAACCAC CTACAAGCTT CCCGAAGGCA CCCTGGAAGA CCGGCTGCGC
TACGTCGGTT ACCCGGACGA CATGATCGCG CGGATGAAGG AACAATGGAG CCAGGAGCAG
CTCGATGTCG TGGGCAAGGA CGGGTTCATG GTCTCGGCCG CCTCGGTCTT CCCAAACATG
AGCTTCGTCC ATAACTGGCC CCGTGTAGAA GAAGACTCCG ACGAAGTTCT CCCATTTATC
TCCATCCGCC AATGGCAGCC CATCAGCGAA GACGAGACCG AGATCGTTTC CTGGTTCGCC
GTGGACAAGA ACGCGTCCGA GGAATTCAAG GCGCTTTCGT ACAAGGCCTA TCTCATGTGC
TTCGGCAGCG GCGGCATGTT CGAACAGGAT GACGTTGAAA ACTGGGTCTC GCTGACGAGC
ACGGCGGGTG GCCCGATGGC CCGCCGCCTG CTGCTCAACA GCCGTATGGG CATGCTGGAA
AACGGGCAGA ACGTTGTAGA ACCGCTGACC TCCGATGAGT ATTCAGGGCC AGGTTCCACC
CGGATCGGCT ACAGCGAATA CAACCAGCGT GAACTGCTGC GGCGGTGGGC CGACCACTTG
GGACGGCCGA TGGAGAAGGC GGCTCAGCTG CACGTCGGCA CCGACCCGAT TCAGGCACCC
CCGGCCGGCG GGGCGGGCCC TTCACTGGCC CCCGCCGGAA GCACCGTTGT CCCAACTGCG
CAGATCATTT CAGAGGAGGC CTAG
 
Protein sequence
MTDQTSVIQD VQRGMIPAHI YNDKEIFELE KERLFGRSWL FVAHESEVPE AGDYVVRRVL 
EDSFIISRDE QGEIRALFNM CLHRGMQVCR AEMGNASHFR CPYHGWSYRN DGRIVGLPFH
KEAYGGEEGF KKKGQTLLPA PSLGVYNGLI FISLDPDAEP LEDFLGDFKF YMDYYTKQSA
DGIELRGPQR WRVKANWKIG AENFAGDMYH TPQTHTSVVE IGLFREPKAE KRKDGTTYWA
GNGGGTTYKL PEGTLEDRLR YVGYPDDMIA RMKEQWSQEQ LDVVGKDGFM VSAASVFPNM
SFVHNWPRVE EDSDEVLPFI SIRQWQPISE DETEIVSWFA VDKNASEEFK ALSYKAYLMC
FGSGGMFEQD DVENWVSLTS TAGGPMARRL LLNSRMGMLE NGQNVVEPLT SDEYSGPGST
RIGYSEYNQR ELLRRWADHL GRPMEKAAQL HVGTDPIQAP PAGGAGPSLA PAGSTVVPTA
QIISEEA