Gene Arth_2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2653 
Symbol 
ID4444774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2976836 
End bp2978218 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content69% 
IMG OID639690473 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_832132 
Protein GI116671199 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000365317 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGGCA CCGCCCCCAC CGAGTCAGCT ACTTCAGGAC CCGTTGCCGA CGTGCCGCAC 
TGGCCGGCAC CGTTCGCCGA AGCGCCCGTC GACGCCACGG TCACCGTCCC GGGCTCCAAG
TCCCTCACCA ACAGGTATCT GGTTCTTGCG GCCCTGGCCG ATGGGCCGTC CCGTCTGCGG
GCACCCCTGC ATTCGCGCGA CTCGGCCCTC ATGATCGAGG CCCTCCGGCA ACTGGGAGCC
GGTATCAGGG AAGTTCATAG CGACGGCGCG TTCGGGCCCG ACCTTGAGGT CACCCCCCTC
CGTGCCGACG CCGCGGCGAC CGATGCCGCC ATCGACTGCG GACTCGCCGG AACAGTCATG
CGCTTTGTTC CGCCGGTGGC TGCGCTCCGC AACGGGGCGA CAGTCTTCGA CGGCGATCCG
CACGCCCGCA AGCGGCCGAT GGGCACCATC ATCGAGGCAC TGGCCGCCCT CGGCGTCGAC
GTCCGCGCTG CGGACGGGAC CCCGCCGTCG GCTCTTCCCT TCACAGTGGC GGGCAGTGGC
CACGTACGGG GCGGCCATCT GGTGATCGAC GCAAGCGCCT CTTCGCAGTT CGTGTCGGCG
CTGCTCCTGG TGGGCGCGCG TTTCACCGAG GGCCTGCACC TTGAGCACGT GGGCAAGCCG
GTCCCCAGCC TGGACCACAT CAACATGACC GTCGCCGTGC TGAGGGAAGT CGGCGTGTCC
GTCGACGATT CCGTCCCGAA TCACTGGGTT GTAGCGCCGG GCCGCATCCG GGCCTTCGAT
CGCCGCATCG AGCAGGACCT GTCGAATGCC GGGCCGTTCC TCGCCGCCGC GCTGGCGACC
CGCGGCACGG TCCGCATTCC CAACTGGCCC TCCCCCACCA CGCAGGTCGG CGACCTTTGG
CGCAGCATCC TGACCGCGAT GGGCGCCACG GTCACGCTGG ACAACGGCAC ACTCACCGTC
ACGGGCGGCC CCGAAATCAC GGGGGCGGAC TTTGCCGACA CCAGCGAACT TGCACCGACA
GTGGCCGCGC TGTGCGCACT GGCCACGGGT CCGTCGCGGC TGACCGGCAT CGCGCACCTC
CGCGGCCACG AAACGGACAG GCTAGCCGCA CTCGTCACGG AGATCAACCG CCTCGGCGGC
GATGCCGAGG AAACTTCCGA CGGCCTGGTG ATCCGGCCCG CGAAGCTCCA TGGCGGCGTC
GTGCACAGCT ACGCGGACCA CCGCATGGCC ACCGCAGGGG CCATCCTGGG CCTCGCCGTT
CCCGGCGTGG AAGTGGAAGA CATCGGCACT ACGTCCAAGA CCATGCCGGA CTTTCCGCAA
CTTTGGGAAT CCATGCTGAC ACAACAGCCG GGCCGGCAGA CGGAACAGGC CCGTGGGGCG
TAG
 
Protein sequence
MTGTAPTESA TSGPVADVPH WPAPFAEAPV DATVTVPGSK SLTNRYLVLA ALADGPSRLR 
APLHSRDSAL MIEALRQLGA GIREVHSDGA FGPDLEVTPL RADAAATDAA IDCGLAGTVM
RFVPPVAALR NGATVFDGDP HARKRPMGTI IEALAALGVD VRAADGTPPS ALPFTVAGSG
HVRGGHLVID ASASSQFVSA LLLVGARFTE GLHLEHVGKP VPSLDHINMT VAVLREVGVS
VDDSVPNHWV VAPGRIRAFD RRIEQDLSNA GPFLAAALAT RGTVRIPNWP SPTTQVGDLW
RSILTAMGAT VTLDNGTLTV TGGPEITGAD FADTSELAPT VAALCALATG PSRLTGIAHL
RGHETDRLAA LVTEINRLGG DAEETSDGLV IRPAKLHGGV VHSYADHRMA TAGAILGLAV
PGVEVEDIGT TSKTMPDFPQ LWESMLTQQP GRQTEQARGA