Gene Dshi_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1022 
SymbolaroA 
ID5710538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1057354 
End bp1058706 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content71% 
IMG OID641266933 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001532365 
Protein GI159043571 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.321263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.615068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCC ACGGTGACCC GATCCCCATG ACCGCCCATC CGTCCGGACC GCTCAGCGGC 
ACAGCGCAGG TGCCCGGGGA CAAGTCGATC TCGCACCGGT CGCTGATCCT CGGCGCGCTC
GCGGTGGGGG AGACGAAGGT CACCGGGCTG CTCGAAGGGC AGGACGTGCT GGACACCGCG
CGGGCGATGC AGGCGTTCGG GGCGGAGGTG ATCCAGCACG CGCCGGGGGC CTGGTCGGTG
CATGGGGTCG GCACCGGCGG GTTTGCCGAG CCCGAGGATG TGATCGATTG CGGCAATTCG
GGCACCGGCG TGCGGCTGAT CATGGGGGCC ATGGCCACGA CGCCGATCAC CGCGACCTTC
ACCGGCGATG CGAGCTTGCG CAGCCGTCCC ATGGGCCGGA TCACCGACCC GCTGGCGGGG
TTCGGGACCA CCGCCGTGGG CCGCCGGGGC GGGCGCTTGC CCATGACGCT GACCGGGGCC
GCGGACCCGG TGCCGGTGCG CTACACCGTG CCGGTGCCGT CGGCGCAGGT GAAATCCGCC
GTCCTGCTGG CGGGGCTGAA CGCGCCGGGG CAGACCGTGG TGATCGAGGC CGAGGCGACG
CGGGACCATT CGGAGCGGAT GCTGCGCGGC TTTGGCGCCG AAATCAGCGT CGAGAGCGCG
CCCGAGGGCA ATGTCATCAC CCTGACCGGC CAGCCGGAGC TGCGCCCCCA GACCATCGTG
GTGCCGCGCG ATCCCTCTTC GGCCGCCTTC CCGGTGGCGG TGGGGCTGAT CGTGCCCGGC
TCCGACGTGC TGGTGCCGGG TATCGGGCTG AACCCGACCC GCGCGGGGCT CTATACCACC
TTGCAGGAGA TGGGCGCCGA GCTGAGCTTC GAGAATATGC GCGAGGAGGG CGGCGAGCCC
GTCGCGGACC TGCGCGCGCG CTTTAGCGAC GCCATGCAAG GCATTGAAGT GCCACCTGAA
CGCGCGCCTT CCATGATCGA CGAATACCCG ATCCTGAGCG TGATCGCCGC CTATGCCACC
GGGCGCACGG TGATGCGCGG CGTCAAGGAG CTGCGCGTCA AGGAGAGCGA CCGGATCGAC
GCCATGGCGC GCGGGCTGGA GGCCTGCGGT GTGCGGGTGG AGGAGGACGA GGATACCCTG
ATCGTGCACG GGATGGGGCC GGGGGGTGTG CCGGGGGGGG CCACCTGCGC CAGCCATCTC
GACCACCGGA TCGCCATGAG TTTCCTGTGC TGCGGGCTGG CCGCGCAGAC CCCCGTCTCG
GTCGACGATG GCGGCCCGAT CGCCACCAGC TTCCCGATCT TCGAGCCGCT GATGACCGCG
CTGGGCGCGA CCCTCACCCG CGACAGTACC TGA
 
Protein sequence
MSAHGDPIPM TAHPSGPLSG TAQVPGDKSI SHRSLILGAL AVGETKVTGL LEGQDVLDTA 
RAMQAFGAEV IQHAPGAWSV HGVGTGGFAE PEDVIDCGNS GTGVRLIMGA MATTPITATF
TGDASLRSRP MGRITDPLAG FGTTAVGRRG GRLPMTLTGA ADPVPVRYTV PVPSAQVKSA
VLLAGLNAPG QTVVIEAEAT RDHSERMLRG FGAEISVESA PEGNVITLTG QPELRPQTIV
VPRDPSSAAF PVAVGLIVPG SDVLVPGIGL NPTRAGLYTT LQEMGAELSF ENMREEGGEP
VADLRARFSD AMQGIEVPPE RAPSMIDEYP ILSVIAAYAT GRTVMRGVKE LRVKESDRID
AMARGLEACG VRVEEDEDTL IVHGMGPGGV PGGATCASHL DHRIAMSFLC CGLAAQTPVS
VDDGGPIATS FPIFEPLMTA LGATLTRDST