Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3164 |
Symbol | |
ID | 5671541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3728070 |
End bp | 3729293 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242059 |
Product | hypothetical protein |
Protein accession | YP_001507479 |
Protein GI | 158314971 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.932938 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCCGGC CGGTGGTGCG GACCGAGCGG CGGCAGGTGT TCGACCTGCC CGAGATGCGT CCGCATGTGG TCGAACACGA GTTGGTCGAG CGGGAGTGCG GCTGCGGGAG GCGGACGCGG GCTGCCGCGC CGGCGGGGGT GGATGCCCCG GTCCAGTACG GGCCGCGGGT CACCGCGGCG GCGGTCTACC TGTACGCGGG CCAGTTCCTG TCCAAGGACC GGACCGCGAC CGCGCTGGCC GAGCTCGTCG GGATCCCGCT GTCCGCCGGC ACGGTCGCGG TGATGACGCG CCGGGCTGCC GCCGGCCTGG ACGGTTTCCT CACCACTGTC CGCGGTCTGC TCGCGGGCAG CGAGGTCCTC GGGGCCGACG AGACCGGGCT GCGGGTCGCC GGGAAGCTGC ACTGGGTCCA CTGCGCCCGC ACCGACAAGT ACACCCTGAT CGACTGCCAT CCGAACCGCG GGAGGGCCGG GATCGACACG CTGGGAGTGC TTCCCGGCTT CGGTGGGGTC GTCGTCCATA ACGCCCGGGC GCCCTATGAC AGCTACACCG ACGCGACCCA CCAGCTGTGT GTCGCTCACG TGCTACGCGA ACTACAGGCC GTCGTCGAAG GCGCCCAGGC CGGGCAGTGG TGCTGGGCCG CCCAGGCCAC CGACGCGCTC GTCGCCCTCC ACACGCAGAC CACCGAAGCA GCCGCCGCAG GCGCGGCCGG CCCAGATCTG GCCGAGCTGG CCGCTCAGAC CCGGCTGCTG CGCCACGCCG CCCATATCGG GATCAGCCAG ACCGCCGACC GGGACACGAA ACTCATGGCG GCCCGCCACG CGCTGGCCTG CCGCCTCGTC GACCGCGAAG CCGACTACCT ACGCTTCACC CGGGACCTGC GGATACCGGC GGACAGCAAC GGCTGCGAGC GCGACATCCG CATGATCAAA CTACGGCAGA AAGTATCCGG GTGCCTACGC ACCCTGACCG GCGCCCGCCA GTTCCGCGCG ATCCGAAGCT ACCTGTCCAC CGTCACCAAA CACGACCTCG GTTCTGTTCC ACGCCCTCGT CCAGCTGGCC GAAGGCCGCC CCTGGACGCC CGCAACAGCC TGACCCCAAA CCAAAGATCA AAAAAGTACC TGACCAGTTA CATCGCCTCG ACCCGAGAGC GTGAGACCGC TACTGCACCC ACCCGATCGG CGGGACGGGT TACCGACTAC CAGCTAAGGT CAACGTCCAC TTAA
|
Protein sequence | MGRPVVRTER RQVFDLPEMR PHVVEHELVE RECGCGRRTR AAAPAGVDAP VQYGPRVTAA AVYLYAGQFL SKDRTATALA ELVGIPLSAG TVAVMTRRAA AGLDGFLTTV RGLLAGSEVL GADETGLRVA GKLHWVHCAR TDKYTLIDCH PNRGRAGIDT LGVLPGFGGV VVHNARAPYD SYTDATHQLC VAHVLRELQA VVEGAQAGQW CWAAQATDAL VALHTQTTEA AAAGAAGPDL AELAAQTRLL RHAAHIGISQ TADRDTKLMA ARHALACRLV DREADYLRFT RDLRIPADSN GCERDIRMIK LRQKVSGCLR TLTGARQFRA IRSYLSTVTK HDLGSVPRPR PAGRRPPLDA RNSLTPNQRS KKYLTSYIAS TRERETATAP TRSAGRVTDY QLRSTST
|
| |