Gene Arth_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3133 
Symbol 
ID4444366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3514981 
End bp3516369 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content69% 
IMG OID639690959 
Productisochorismate synthases 
Protein accessionYP_832611 
Protein GI116671678 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAGTA CGTTCCGCAC CTTGACAGTC CCTCTGGATG GAAGATCATT CTCCGGGGGG 
CTGCCGTCAT TTCTGGTCCG GGACGATGTT CTCTGCTGGA CCCGCCGCGA GGCCGGGCTG
GTGGGCTTCG GCGAGATCGC CCGCTTCACC GCCACAGGGC CCGACCGCTT CCTGGAGGCC
GACATCTGGT GGCGGCACCT GGTCCTCGAG GCGGACGTCA CCGACTCCGT GGAGTGCCCC
GGCACCGGTC CAGTGGCCTT CGGCTCCTTC GCCTTCTCCA AGGTCTCCAC CCACCGCTCA
CGGCTGATTG TTCCGGAAAT CGTTGTGGGG GTCAGGGACG GCCGCGCCTG GCTCACCCAG
TTAACGTTCG ACGACGGCGA ACTCACCGAG GCGGGCGCCC TCGCCGCCCT GCAGCGCTGG
CTCGAGACCG ACCCGCTGGA CGTGGACCCT GTATACGACA CCGAACCCCG GGACACCGCC
GTCGTGCTTT CACCGGGTGC GCGCCCCGGC GGCGGCATGA CCGCAGATCC CGGCGCCGAC
GTCGTACCTA AACTTGAAAC CGGATCGCTC AGCGAGCACG ACTGGATGGC AGCGGTAGCC
GCAGGCGTCG AGGAAATCCG GACCGGCAAG CTGGAGAAGC TGGTGCTGGC CCGGGACATC
GTGGCCACGA TTCCGGAGGG CGTGAACGCG GCGGAAATCC TGCGCCAGCT GGCCGTCCGC
TACCGCGAAT GCTGGACCTA CGGCGTGGAC GGCCTGGTGG GGTCGACCCC GGAAATGCTG
ATCCAGGTCG AGGGGCGGAC CGCCCAGGCC CGCGTCCTGG CCGGAACCCT TGACCGGCGC
GATGCCGAAG GGATGGACGG GCCGCCGCTG GAGTTCGCCG AGCGCGTGCT GGCCGGGTCC
GAGAAGCAGC GGCACGAGCA CGAGATCGCG ATCCAGTCGC TCACCACCCA GCTGGCGCCG
TTTTCCGAGG CCATGAACGC GCACAGCGAA CCGTTCATCC TGGAGCTGCC AAACGTGTGG
CACCTGGCGT CGGACGTGAA GGCCGAACTG ACCGAGGTGG AGGGGCACGT GCCCACGTGC
CTTGCGTTGA TCAACGCGCT CCATCCCACG GCCGCCGTGT GCGGAACGCC CACCACCGTG
GCCGGTGCGC TCATCCGCAA GCTGGAGCAC ATGGACCGCG GTCCCTACGC GGGACCGGTG
GGCTGGCTGG ATGCGGCGGG GAACGGCGAA TGGGGCATCG CGCTGCGCGG CGCCGTCGTC
GAGGCGCCGG ACACGGTGCG GCTCTACGCC GGCTGCGGCA TCGTGGAGGG CTCGCACCCT
GAAGCTGAGC TCGCCGAGAC CTGGGCGAAG TTCCGTCCGA TGCTCGAGTC GCTGGGCATC
AAGAGCTAG
 
Protein sequence
MTSTFRTLTV PLDGRSFSGG LPSFLVRDDV LCWTRREAGL VGFGEIARFT ATGPDRFLEA 
DIWWRHLVLE ADVTDSVECP GTGPVAFGSF AFSKVSTHRS RLIVPEIVVG VRDGRAWLTQ
LTFDDGELTE AGALAALQRW LETDPLDVDP VYDTEPRDTA VVLSPGARPG GGMTADPGAD
VVPKLETGSL SEHDWMAAVA AGVEEIRTGK LEKLVLARDI VATIPEGVNA AEILRQLAVR
YRECWTYGVD GLVGSTPEML IQVEGRTAQA RVLAGTLDRR DAEGMDGPPL EFAERVLAGS
EKQRHEHEIA IQSLTTQLAP FSEAMNAHSE PFILELPNVW HLASDVKAEL TEVEGHVPTC
LALINALHPT AAVCGTPTTV AGALIRKLEH MDRGPYAGPV GWLDAAGNGE WGIALRGAVV
EAPDTVRLYA GCGIVEGSHP EAELAETWAK FRPMLESLGI KS