Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3211 |
Symbol | |
ID | 3906177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3803540 |
End bp | 3804580 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 637880535 |
Product | shikimate dehydrogenase |
Protein accession | YP_482297 |
Protein GI | 86741897 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.121067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGAGA TCGTTCCCGC CGGCACTACC CCGGCCCGCC GTCGGCCCCC TGAGTGCCGT CGTCGGCCCA TGGAACCGCC GGCGACCGTC GCGGCCACCG CGCCGCCGTC GGGCCCCACC CGCGCCGCCG TGCTCGGGGC ACCGGTGGAC CATTCGCTCT CCCCGCTGCT GCATGCCGCC GCCTACGCCA AGCTCGGTCT CGCGGTGACG TACACCGCCG TGCACTGCGA CGAGACCGGG CTCGCCGCGA TGCTCACCCG GGTCCGCACG GATCCGGGCT GGGTCGGGCT GTCCCTGACC ATGCCGCTCA AGACCGTCGC GCTCGACCTG CTCGACGAGG TGGACGCGAC GGCGGCGGTC ATCGGCGCCG TCAACACGGT CGTCGTCGGG CCCGCCGGCC GGTTGCGTGG CTACAACACC GACGTGGACG GCATCGGCAT GGCGCTGCGC CGGGTGATGC GCGGCGCGGT CCCAGGCCAG CCGCTCGTAC TCGGTGCCGG TGGCACGGCC CGCGCCGCGG TCGCGGCGGT CGCCGCGGCG GGCTGCACCC GCCTCGGCGT CGTCGCCCGC CGGCCCGCCG CCGTGGCGGA GGTGGCGGAG ATCGGGTCGC GGCTGGGCGT CGAGGTCACC GCGCTGCCCT GGGAGCTGCT GGCCGCGGGC CTGCCTGCCG GTCCGGATCT GGTCATCTCC ACCACGCCCG CCGGCGCGAC CGACGGGCTC GCCACCGGAC CGTGGCCGCC AGCCTGCCAG CTCGTAGAAC TGCTCTATCA TCCCTGGCCC ACGGCGCTGG CCGCCGCGGC CTACCGGGCC GGTGCCCGGG TCGCAGGTGG CCTGGAGATC CTCGCCGCCC AGGCCGTGGG GCAGGTCGAG CACTTCACCG GGCAGGTGGT TCCCACCAGC GTTCTGCTGG CCGCGGGTCA GGCCGCGCTG GACGAGCGGA CGCGGGGGAA CAGGCCCCCG GCGGTGGAGG TCGGCGTGCC GGGCGGCCAC GGTCTCGCGG GGAGCGGTCC CCGCGGCCGC GGTGGACCTG CCGGCGGATA G
|
Protein sequence | MPEIVPAGTT PARRRPPECR RRPMEPPATV AATAPPSGPT RAAVLGAPVD HSLSPLLHAA AYAKLGLAVT YTAVHCDETG LAAMLTRVRT DPGWVGLSLT MPLKTVALDL LDEVDATAAV IGAVNTVVVG PAGRLRGYNT DVDGIGMALR RVMRGAVPGQ PLVLGAGGTA RAAVAAVAAA GCTRLGVVAR RPAAVAEVAE IGSRLGVEVT ALPWELLAAG LPAGPDLVIS TTPAGATDGL ATGPWPPACQ LVELLYHPWP TALAAAAYRA GARVAGGLEI LAAQAVGQVE HFTGQVVPTS VLLAAGQAAL DERTRGNRPP AVEVGVPGGH GLAGSGPRGR GGPAGG
|
| |