Gene Arth_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2047 
Symbol 
ID4445421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2308302 
End bp2309633 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content66% 
IMG OID639689855 
Producterythromycin esterase 
Protein accessionYP_831527 
Protein GI116670594 
COG category[R] General function prediction only 
COG ID[COG2312] Erythromycin esterase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAACG GAAACCGTGC TGTCCTGAGC ATGCTGGACG AGATCCGCAC CCTGGCCCGG 
CCCCTCACCG GGATCCGGGA CCTTGACCGG CTGGTCCACG GCGCCGGTAC CGGACGCTTC
GTGGCCATCG GCGAAGCATC CCACGGCACG CACGAGTACT ACACCATGCG TGCCCGCCTG
AGCATGCGGC TGATCGAAGA GCAGGGTTAC AGCTGGATCG GGGTGGAAGG CGACTGGCCG
GACTGCTGGC GGATCAACCG CTGGGTCCGG GGACAAAGCG GCCAGGACAC TGGAGTGCAC
ACCATGCTCG CCGGATTCGG GCGCTGGCCC ACCTGGATGT GGGCAAACGA GGAAGTGGCA
GGCTTCCTTG ACTGGCTCCG CGGCTGGAAC CTGGAGCGTC CGATGGAAGA GCGGGTTGGC
TTCTACGGGC TGGACGTCTA CTCACTGTGG GATTCCCTGC GGGAGATCAT CGGCTGGCTC
GAGGAAAACG AGCCCGACGC CGTCCCCGCT GCCATGCGGG CGTGGCGGTG TTTCCTTCCG
CATCACGAAG ACCCGCACGA GTACGCCTGG AGCACCCGGC TGGTCCCCGA ATCGTGCGAG
GCTGATGTGG TTGCACTCCT CACCGAGGTG CGGAACCGGG CATTTGCGCT GCGCGACCAC
GAACCGCGCG TCGAAAGCGA TGAGGCTTTC GACGCGGTCC AGAACGCTGT GGTTGCCGCC
AACGCCGAGC ATTACTACCG CATCATGGTG CAGGGGAGCC GCGAGTCCTG GAACGTCCGG
GACCTTCACA TGGCGGACAC CGTGGACCGG TTGAGTGCGC ATCTGGGCCC GGCGTCCAAA
GGGATCATCT GGGAGCACAA CACCCATGTG GGCGATGCGC GGGCCACGGA CATGGCGCGG
GACGGGCTGG TGAACGTTGG CCAGCTGCTC CGCGAGCGCC ACGGTTCCGA AGGCGTCACC
CTGGTGGGTT TCGGATCGTA CCGGGGAACA GTCATGGCTG CGGACGCCTG GGGCTCCCCG
GAACGGGTGC TCACGGTGCC GGAAGCCCGG ACCGGCAGCC ACGAGGATCT GCTGCACCGG
GCCCTGGGCG CACCGGCATT GTTGGAATTC GGCGGGGACA GGTCCGGGCC GTGGCTGTCA
ACCTGGTTGG GTCACCGGGC CATCGGTGTT GTCTACCGCC CGGCCAGGGA ATCCGGAAAC
TACGTTCCCA CCCGGATGGG AGGGCGTTAC GACGCCCTCA TCTGGATGGA ACAGACCTCG
GCGCTTCGTC CGCTGCATCA CGAGGCTCCG CCGAGTGAGC CGGAGTTCGA AACGGAGCCC
ACCGGATTCT GA
 
Protein sequence
MTNGNRAVLS MLDEIRTLAR PLTGIRDLDR LVHGAGTGRF VAIGEASHGT HEYYTMRARL 
SMRLIEEQGY SWIGVEGDWP DCWRINRWVR GQSGQDTGVH TMLAGFGRWP TWMWANEEVA
GFLDWLRGWN LERPMEERVG FYGLDVYSLW DSLREIIGWL EENEPDAVPA AMRAWRCFLP
HHEDPHEYAW STRLVPESCE ADVVALLTEV RNRAFALRDH EPRVESDEAF DAVQNAVVAA
NAEHYYRIMV QGSRESWNVR DLHMADTVDR LSAHLGPASK GIIWEHNTHV GDARATDMAR
DGLVNVGQLL RERHGSEGVT LVGFGSYRGT VMAADAWGSP ERVLTVPEAR TGSHEDLLHR
ALGAPALLEF GGDRSGPWLS TWLGHRAIGV VYRPARESGN YVPTRMGGRY DALIWMEQTS
ALRPLHHEAP PSEPEFETEP TGF