Gene Francci3_3211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3211 
Symbol 
ID3906177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3803540 
End bp3804580 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content77% 
IMG OID637880535 
Productshikimate dehydrogenase 
Protein accessionYP_482297 
Protein GI86741897 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.121067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAGA TCGTTCCCGC CGGCACTACC CCGGCCCGCC GTCGGCCCCC TGAGTGCCGT 
CGTCGGCCCA TGGAACCGCC GGCGACCGTC GCGGCCACCG CGCCGCCGTC GGGCCCCACC
CGCGCCGCCG TGCTCGGGGC ACCGGTGGAC CATTCGCTCT CCCCGCTGCT GCATGCCGCC
GCCTACGCCA AGCTCGGTCT CGCGGTGACG TACACCGCCG TGCACTGCGA CGAGACCGGG
CTCGCCGCGA TGCTCACCCG GGTCCGCACG GATCCGGGCT GGGTCGGGCT GTCCCTGACC
ATGCCGCTCA AGACCGTCGC GCTCGACCTG CTCGACGAGG TGGACGCGAC GGCGGCGGTC
ATCGGCGCCG TCAACACGGT CGTCGTCGGG CCCGCCGGCC GGTTGCGTGG CTACAACACC
GACGTGGACG GCATCGGCAT GGCGCTGCGC CGGGTGATGC GCGGCGCGGT CCCAGGCCAG
CCGCTCGTAC TCGGTGCCGG TGGCACGGCC CGCGCCGCGG TCGCGGCGGT CGCCGCGGCG
GGCTGCACCC GCCTCGGCGT CGTCGCCCGC CGGCCCGCCG CCGTGGCGGA GGTGGCGGAG
ATCGGGTCGC GGCTGGGCGT CGAGGTCACC GCGCTGCCCT GGGAGCTGCT GGCCGCGGGC
CTGCCTGCCG GTCCGGATCT GGTCATCTCC ACCACGCCCG CCGGCGCGAC CGACGGGCTC
GCCACCGGAC CGTGGCCGCC AGCCTGCCAG CTCGTAGAAC TGCTCTATCA TCCCTGGCCC
ACGGCGCTGG CCGCCGCGGC CTACCGGGCC GGTGCCCGGG TCGCAGGTGG CCTGGAGATC
CTCGCCGCCC AGGCCGTGGG GCAGGTCGAG CACTTCACCG GGCAGGTGGT TCCCACCAGC
GTTCTGCTGG CCGCGGGTCA GGCCGCGCTG GACGAGCGGA CGCGGGGGAA CAGGCCCCCG
GCGGTGGAGG TCGGCGTGCC GGGCGGCCAC GGTCTCGCGG GGAGCGGTCC CCGCGGCCGC
GGTGGACCTG CCGGCGGATA G
 
Protein sequence
MPEIVPAGTT PARRRPPECR RRPMEPPATV AATAPPSGPT RAAVLGAPVD HSLSPLLHAA 
AYAKLGLAVT YTAVHCDETG LAAMLTRVRT DPGWVGLSLT MPLKTVALDL LDEVDATAAV
IGAVNTVVVG PAGRLRGYNT DVDGIGMALR RVMRGAVPGQ PLVLGAGGTA RAAVAAVAAA
GCTRLGVVAR RPAAVAEVAE IGSRLGVEVT ALPWELLAAG LPAGPDLVIS TTPAGATDGL
ATGPWPPACQ LVELLYHPWP TALAAAAYRA GARVAGGLEI LAAQAVGQVE HFTGQVVPTS
VLLAAGQAAL DERTRGNRPP AVEVGVPGGH GLAGSGPRGR GGPAGG