Gene Arth_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0231 
Symbol 
ID4447322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp242698 
End bp244047 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content70% 
IMG OID639688027 
ProductTat-translocated enzyme 
Protein accessionYP_829732 
Protein GI116668799 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGGACA CCTCCCAGGC CCCGGCCTCA GCTGACAGCG CCCCCAACGG TGCCGCGGAC 
GCCAAAGCCG CCGCCGCCCG CGGCGTTTCT CGACGGGGCC TGCTTTCTTT CGCCGGGGTC
GGTGGCGCCG GGGCGCTTGC CGGAATTGCC GCCGGGCTGT GGGGCCGCGA CGCCGTCTTC
GCGGCAGAGC CGGCCGTGGA ACCGGCTGAT GACACCGTCC CGTTCCACGG GGAGCGCCAG
GCCGGAATTA CGACGGCGGC GCAGGACCGG CTGCACATGG CGGCCTTCGA TGTCCTCACC
GAGGACCGCG ACGAGCTGAT CCGGCTGCTC AAGGACTGGA CGGCCGCCTC GGAGGCCATG
ACGCAAGGCC GCGAAACCGG CGAAACCGGC GCGGCCGGCG GCTCCTATGA CGCCCCGCCG
CAGGACACCG GCGAGGCCTT GGGACTCAGC GCCGGCAAGC TCACCGTGAC CTTCGGCTTT
GGTGCCAGCC TGTTCGAAAA GGACGGAAAG GTGCGGTTCG GGCTCGAGGG CAGGCGCCCT
GATGCCCTCA TCGACCTGCC GCATTTCCCG GGCGATGACC TGCAGGCGGG ACGCAGCGGC
GGGGACATCA TCGTGCAGGC CTGCGCTGAC GATCCCCAGG TGGCCGTCCA CGCCGTCCGC
AACCTGGCCC GGCTGGGGTT CGGCAAGGTC CGCGTCCGCT GGTCCCAGCT GGGCTTCGGG
CGCACGGCCT CCACGTCCCG CGCACAGCAG ACGCCCCGCA ACCTGTTCGG TTTCAAGGAC
GGCACCAACA ACCTCAAGGT CGAAGACACG GAGCTGCTGG AGAACCACGT CTGGGCCGGG
GCAGGCACCC GGCCCGGAGA AGCCTGGATG GAGGGCGGAA GCTACCTTGT GGCCCGCCGC
ATCCGCATGC ACATCGAGAT CTGGGACCGG ACGTCCCTGG GCGAGCAGGA AGCCCTGATC
GGACGGACCA AGGCCGAGGG CGCCCCGCTG TCCGGCGGCA AGGAATTCAC CGCCCCTGAT
TTCACCATCA AGGGCAAGGA CGGCAAGCCC CTGATGGGCT TGGACTCACA TGTCCGGCTG
GCCCATGCCG ACCAGAACGG CGGGGTCCGG ATGCTGCGCC GCGGGTACAA CTACACGGAC
GGATCCGACG GGCTTGGGCA CCTCGACGCC GGGCTGTTCT TCATCGCCTT CGTCAAGGAC
CCGCGCACGC ACTATGTGCC CATGCAGATG GCGATGGCCA AGCAGGACAC CCTGGCCGTG
GAGTACCTCA AGCACACCGG CTCCGCCCTG GCCGCGGTGC CGCCGGGCAC GAGGCCCGGC
GGCTTCCTCG GAGAAGGCCT CTTCAGCTGA
 
Protein sequence
MGDTSQAPAS ADSAPNGAAD AKAAAARGVS RRGLLSFAGV GGAGALAGIA AGLWGRDAVF 
AAEPAVEPAD DTVPFHGERQ AGITTAAQDR LHMAAFDVLT EDRDELIRLL KDWTAASEAM
TQGRETGETG AAGGSYDAPP QDTGEALGLS AGKLTVTFGF GASLFEKDGK VRFGLEGRRP
DALIDLPHFP GDDLQAGRSG GDIIVQACAD DPQVAVHAVR NLARLGFGKV RVRWSQLGFG
RTASTSRAQQ TPRNLFGFKD GTNNLKVEDT ELLENHVWAG AGTRPGEAWM EGGSYLVARR
IRMHIEIWDR TSLGEQEALI GRTKAEGAPL SGGKEFTAPD FTIKGKDGKP LMGLDSHVRL
AHADQNGGVR MLRRGYNYTD GSDGLGHLDA GLFFIAFVKD PRTHYVPMQM AMAKQDTLAV
EYLKHTGSAL AAVPPGTRPG GFLGEGLFS