Gene Franean1_0958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0958 
Symbol 
ID5669372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1121846 
End bp1123225 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content77% 
IMG OID641239886 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001505320 
Protein GI158312812 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0792374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0544047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCGCG GGCAGGACGA CCCGTGGTCG GCTCCGGTCG CGACCGGTCC GGTCCGGGCG 
ACCGTCACCG TGCCGGGCTC GAAGTCCGGC ACGAACCGGG CGCTCGTGCT GGCCGCGGCG
GCTGACGGGG TCTCCCGCCT GCGCGGGGCG CTGCGCTCCC GGGACACCGT CCTGATGGCC
GCCGCCCTGC GCGAGCTCGG CGCGACGGTG ACCGACGAGG CCGCGCCAGG CGGCGCTGAG
CCGGGCAGGC CGGGTGAGCC GGGCGCGGAC CAGGGCGCCG CCGACATCGT CGTCACCGGC
CCGGTCGGCG CCGTACGGGG CACGGCCGCG ATCGACTGCG GCAACGCGGG AACGGTCGCG
CGCTTCACTC CCGCGCTGGC GACACTGGCC CGCGGGGACG TGCGCTTCGA CGGCGATCCC
CGGATGCGGG ACCGCCCGCT CACCCCGCTG CTGCGCGCGC TGCGCGAGCT GGGCGCCCAC
ATCGACGGCG ACCGGATGCC CTTCACCGTG CGCGGCACCG GGGCGGTCTC CGGCGGGGCG
GTGACGGTCG ACGCGTCGGA TTCCAGCCAG CTCGTCTCCG GCCTGCTGCT CGCCGCGGCG
CGGTTCGAGC GCGGTGCGAC CGTGACCCAC GCCGGTCACC GCCTGCCGTC CGGGCCGTAC
CTCGACATGA CCGTCGCCGA CCTGCGGGCG GCCGGGGTGG TCGTCGACGT CGACGACCCG
ACCGCGGACC TGCTGCGCGC CGGGGGCACT CCGGCCGCGT CGACCCGGCG CTGGCGGGTC
AAGCCCGGCG GGCCGCGGCC GCTGGACCGC GTGATCGAGC CCGACCTCAA CAGCGCCGCC
CCCTTCGTCG CCGCCGCGGC GGTGACCGGC GGCGAGGTGA CGATCACCGG CTGGCCGGCG
TCCACCGAGC AGCCCGGCCG GATGCTGCCG GACCTGCTGG TGGCCATGGG CTGCCGGGCG
GAGCTGGTGC CGGAGGGCCT GCGCGTCACC GGCGGCGGGC GGATCACCGG TATCGACGTC
GATCTCTCCG ACTTCGGCGA GGCGGCGCCG GTGCTGACCG GGCTGGCCGT GCTGGCGGAC
TCGCCGTCCC GGCTGCGGGG CATCGCCCAC CTGCGCCTGC AGGAGACCGA CCGGCTCGCC
GCGCTGGCGA GCGAGCTCGG CCGGCTCGGC GCCCGTGTCA CCGTCACCGA CGACGGCCTG
TCGATCATCC CGGTGCCGCT GCGCGGCGCC CGGCTCGACC CGCACGCGGA CCACCGGCTG
GCGATGACCT ACGCCGTGGT CGGCCTGGCG GTGCCCGGGG TCACCGTCGA CGACATCGCC
ACGACCGGCA AGACGGTCCC CGACTTCGCG CGGATGTGGA CGACGATGCT GGCCGGCTGA
 
Protein sequence
MQRGQDDPWS APVATGPVRA TVTVPGSKSG TNRALVLAAA ADGVSRLRGA LRSRDTVLMA 
AALRELGATV TDEAAPGGAE PGRPGEPGAD QGAADIVVTG PVGAVRGTAA IDCGNAGTVA
RFTPALATLA RGDVRFDGDP RMRDRPLTPL LRALRELGAH IDGDRMPFTV RGTGAVSGGA
VTVDASDSSQ LVSGLLLAAA RFERGATVTH AGHRLPSGPY LDMTVADLRA AGVVVDVDDP
TADLLRAGGT PAASTRRWRV KPGGPRPLDR VIEPDLNSAA PFVAAAAVTG GEVTITGWPA
STEQPGRMLP DLLVAMGCRA ELVPEGLRVT GGGRITGIDV DLSDFGEAAP VLTGLAVLAD
SPSRLRGIAH LRLQETDRLA ALASELGRLG ARVTVTDDGL SIIPVPLRGA RLDPHADHRL
AMTYAVVGLA VPGVTVDDIA TTGKTVPDFA RMWTTMLAG