Gene Francci3_3771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3771 
Symbol 
ID3906055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4519584 
End bp4520924 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content73% 
IMG OID637881097 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_482851 
Protein GI86742451 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0729461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGCA TCCTGACGGG TAGCGAAGCC ACCGGAACGC AGTCCGATCC GTGGCCGGCT 
CCACTCGCAC GGCAACCGGT CGAGGCCGTC GTCGCGGTGC CCGGCTCGAA GTCCGGCACA
AACCGGGCAC TCGTGCTCGC CGCGCTGGCG AACGGAACCT CCCGGCTGCG GGGTGCGCTG
CGCTCCCGCG ACACCCTGCT GATGGCCGGG GTGCTGAGGA CGTTGGGTAT CGAAGTATCC
ACCGAGGGTC CCGACTGGGT CGTCCACGGC CACCCGACAC CGAGCGCCGC ACCCACCGCG
CGCGCGGAGT GCGGAAACGC GGGCACCGTG GCGCGCTTCA CCCCGGCACT GGCCACGCTG
ACCCGCGGCG ACGTCGTCTT TGACGGAGAC GCGCGGATGC GCGAGCGTCC GCTCACCCCG
CTGCTCGGCG CGCTGCGCGA GCTCGGCGCC GAGATCGACG GCGACCGAAT GCCCTTCACC
GTGCACGGCC GAGGGCGGCT GCGTGGCGGC GAGGTGATCG TGGATGCCTC CCACTCCAGC
CAGCTTGTCT CGGGGCTGCT CCTCGCCTCC CCGCACTACG ACACCGGGGT GACGGTCGTC
CACCGCGGGT CCCGGCTGCC ATCGGCTCCC TACCTGGACA TGACCGTCGC GGATCTACGG
GCCGCCGGTG CCACGGTCGA GGTCGACGCC GTGTGCTCCC CCGCGGCCGG CCCGGTGGCG
GACACCCGGC GCTGGCGCGT CGAACCGGGC CGGCCGACCG CCGCGGACCG GACCATCGAA
CCGGACATGA ACAGCGCGGC GGCCTTCCTC GCGGCGGCGG TCGCCACCGG CGGGCGGGTG
ATGATCTCGG ACTGGCCGGA GTCGACCGAG CAGCCCGGCC GGCTGCTGCC CGATCTGCTG
GTGGCGATGG GAGGCACGGC CCGGCGCACG TCGGCGGGCC TGGAGATCAC CGGCGCCGGT
GCCGTCCACG GGATCGACGT CGATCTGTCC GACTTCGGGG AAGCCGCTCC GATACTGACC
GCCCTCGCCG TGCTCGCCGA CTCGCCGTCA CGGCTACGGG GCATCGCGCA CCTGCGGCTG
CAGGAGACCG ACCGGCTGGC CGCCCTGGCC CTCGAACTCG GTCGCCTGGG CGCACGGATC
ACCGTCGCCG ACGACGGGCT GGCGATCAGT CCGGCGCCCC TGCACGGCGC CCGGCTCGAC
CCGCACGCCG ATCATCGGCT GGCGATGGCC TACGCGGTGG TCGGCCTCGT GGTGCCCGGG
ATCGTCATCG ACGACATCGC GACGACCGGC AAGACGGTGC CGGACTTCCC CGAGATGTGG
ACGGCGATGC TGGACCAATG A
 
Protein sequence
MVSILTGSEA TGTQSDPWPA PLARQPVEAV VAVPGSKSGT NRALVLAALA NGTSRLRGAL 
RSRDTLLMAG VLRTLGIEVS TEGPDWVVHG HPTPSAAPTA RAECGNAGTV ARFTPALATL
TRGDVVFDGD ARMRERPLTP LLGALRELGA EIDGDRMPFT VHGRGRLRGG EVIVDASHSS
QLVSGLLLAS PHYDTGVTVV HRGSRLPSAP YLDMTVADLR AAGATVEVDA VCSPAAGPVA
DTRRWRVEPG RPTAADRTIE PDMNSAAAFL AAAVATGGRV MISDWPESTE QPGRLLPDLL
VAMGGTARRT SAGLEITGAG AVHGIDVDLS DFGEAAPILT ALAVLADSPS RLRGIAHLRL
QETDRLAALA LELGRLGARI TVADDGLAIS PAPLHGARLD PHADHRLAMA YAVVGLVVPG
IVIDDIATTG KTVPDFPEMW TAMLDQ