Gene Arth_4346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4346 
Symbol 
ID4443457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp85637 
End bp87154 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content63% 
IMG OID639687667 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_829364 
Protein GI116662310 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.23778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAA ACATCTGTTC CAGTTCAGAA ACTTGTCCGA CGTCAGAAAC GTGTCCCACG 
CCGGCATCTC AGACCGAGGA CCCGCGCCGC GCCCGTGATC TGGTCGACTT GGAGTCCGGG
CAGATCAGCC GGGAGGTCTT CTTCAACCAG GAGATCTTCG ATCTCGAAAT GCAGAATCTG
TTCCCCCGTG CCTGGCTTTT CGTCGGCCAT GCCTCGCAGA TCCCGAACCC GGGCGACTAC
TTCTCGTCGT GGATGGGCAG CGACCCGGTC CTGCTCACAC GCGACGTGGA CGGTGGGATC
TACGTGCTGC TGAACTCCTG CCGCCACCGC GGCATGCGCG TGTGCCGTTA TGACGAGGGC
AACACGATGC AGTTCACCTG CCCCTATCAC GCATGGTCGT ATTCGATGGA CGGCAGCCTG
GTTAACGTCC CTGGGGACCT CTTTGGCGTC CCGCATATGA AGGCCGCCTA CAGCGGCAAG
CTCGATAAGC AGAACTGGGG CCTTGTGCGC TGCCCTAAGG TCTACAACTA CAAGGGCCTC
GTCTTCGCCA ACTGGGATGA AAACGCGGAG GACTTCCTTG ACTACGCCGG CGACTTCCAC
TGGTGGCTGG ACAACCTTGC CGATGCCTTC GATGGCACCC CCGGCGACAC TGAGGTCTTC
CACGGTGTAC TCAAGTGGCG CATCAAGTCG AACTGGAAGT TCGTCTCTGA AAACTTTCTC
GGCGACACCT ACCACGGCGC ATCGACGCAC GCCTCGGTCG AGGCCATCGG CATCGGCCCT
GGCGGCCGCG GCAAGCGCCG CCACGGCGAA CGTCAGGACG AGGGCGGCCA CTCGACGGGC
CGTATGAAGA CCTCGTTCCG CAACGGTCAT GGCGCCAGCG ACAACCTCGC CTATGAAATC
GCCTATCCCC AGTTCGTTGA GCCGGAGATG AACGAGTACT TTGACCAGGC CTGGGCGACC
CGCAAGGAGC GCCTGGACGC GGAGGGCCGC CTCCTCGGCG GTCGCGGGCC GGCGACGATG
TTCCCGAACA TGTCGTTCGC GGCCGGCTTC CCGCGGAGCA TCCTCGTCGC GCATCCGATC
AGCCCCACCG AGACCGAGGT GTGGCGCTGG TTCCTCAGCG ATAAAAAGGC CCCCGAGCAC
GTGCGTGAGT GGCTGCGCCA GTACTACATG CGCTATGGCG GCCCTGCGGG CATGACCGAG
CAGGACGACA TGGAGAACTG GGACTACGCC ACCCAGGCGT CCAAGGGCGT CGTTGCCCAG
CGGTACCCGT ACAACTACCA GCAGGGTCTC GGCACCGAGC AGCTCTCTGA GCTCGATCGT
GCGGTGCATT CCAACCACGC GATCTCCGGC GAGGTGAACG CCCGGGCCTT CTACCGCCGC
TGGTCGGAGT TCGTCGACAA CCTCAGCTGG GACGAGCTCC TCGAGATCGC GAAGTCGGAC
GATCGGATTG ACGCGATTAT TCGCCGTCAA GAGGAAGACG CGGTTGCGGA AGCCGCCGCA
GGGAAGGTAA GCCACTGA
 
Protein sequence
MTTNICSSSE TCPTSETCPT PASQTEDPRR ARDLVDLESG QISREVFFNQ EIFDLEMQNL 
FPRAWLFVGH ASQIPNPGDY FSSWMGSDPV LLTRDVDGGI YVLLNSCRHR GMRVCRYDEG
NTMQFTCPYH AWSYSMDGSL VNVPGDLFGV PHMKAAYSGK LDKQNWGLVR CPKVYNYKGL
VFANWDENAE DFLDYAGDFH WWLDNLADAF DGTPGDTEVF HGVLKWRIKS NWKFVSENFL
GDTYHGASTH ASVEAIGIGP GGRGKRRHGE RQDEGGHSTG RMKTSFRNGH GASDNLAYEI
AYPQFVEPEM NEYFDQAWAT RKERLDAEGR LLGGRGPATM FPNMSFAAGF PRSILVAHPI
SPTETEVWRW FLSDKKAPEH VREWLRQYYM RYGGPAGMTE QDDMENWDYA TQASKGVVAQ
RYPYNYQQGL GTEQLSELDR AVHSNHAISG EVNARAFYRR WSEFVDNLSW DELLEIAKSD
DRIDAIIRRQ EEDAVAEAAA GKVSH