Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5027 |
Symbol | |
ID | 5673365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6021572 |
End bp | 6022699 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243881 |
Product | hypothetical protein |
Protein accession | YP_001509296 |
Protein GI | 158316788 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0475457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.219528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCGC CGAAGAAGCC AGCGAGTGCG GCCACCCTGG CAGCGCTGGG GACGCTGTCC GCCGCCACCC GCGGCCGGTG GCGGATCAAG GCTGGGCCGA GCTGGACCGA GCCGCTCGCG CTCTACGTCG CCGCTGTAGC CGACCCCGGC GAACACTCCG GGACCGTCCT GCGTGCTGTC GCCGCCCCAC TACTCGAGGC CGAGCGCCTG GCGCACGCCT CGCGCGCCGA TGACCACCGG AAGGCACTCC GGCGGCACCA CGTCGCCCAG CGCCGGTACG AGACGGCCGT GGAGGCGGCC GCCCAGGCGC CTGCGGCACA GGCGCGGGAG GCCAACGAGC GGGTGATCGA GGCGATCGAG GATCTGGCCC GCCATCCCGA GCCCGATGAC CAGCGGATCA CCGTCGGCCA CGTCGAGCCG AGGGCGCTGC CGGTGCTGCT CGCCGCCGAG GGAGGTGCCC TGGCCCAGCT CGCGACCAGT GGTGGCCTGC TCGCTGCCAT CGCCGACCCA GGTGACGCCG AAGACGGCTG GGTGGCCCTG TTCACGCTCC TGCGGGCCTA CGACGGCCAG GCGCTGTCCA TCGACCGGGA GCCCGACCTG CACGTCCACG TTGAAGAGCC GTTCATGGCT CTCGTTGTGA CCCTCACGCC CGAGGAGATC GCCAGTCCCA GCGCCCGCCA GCACCTGCGG GGCCTGCTGC CGCGGCTGCT GTTCGCCGCG CCCGCACCGA TGGCCGGAAC GCGGACGACG GACTCGCCAG CGATGCCAGA CGAGGTGAAC ACCGTCTGGG CCGGAGCCGT TCGTGGCGTC CTTGACGCCG CGCTGCGCGC CGACGGCATC ACGATGGTGG AACTGGAGCC GGCCGCCCGG GAGATCTTCG AGGTGTTCCG CGCCGGGTGG GAACCGCGAC TGCATCCCGA GACCGGCGAC CTCGCCGACA TCCAGACGTG GGCGACCAAG CATCCGGGGC GGGCCGCGCG CATCGCCGCG CTGCTGGCGC TGGCCGAGGA CCCGGCGACC ACCACGGTGG GCGTGGAGCA TGTGTGGGCC GCCGTGAACC TGGCTGAGGT CCACATGGCG CACGCGCGGG TCGCGCTGAC CGGTGCGGGC GTCGAGGGGG CGGCGTGA
|
Protein sequence | MSSPKKPASA ATLAALGTLS AATRGRWRIK AGPSWTEPLA LYVAAVADPG EHSGTVLRAV AAPLLEAERL AHASRADDHR KALRRHHVAQ RRYETAVEAA AQAPAAQARE ANERVIEAIE DLARHPEPDD QRITVGHVEP RALPVLLAAE GGALAQLATS GGLLAAIADP GDAEDGWVAL FTLLRAYDGQ ALSIDREPDL HVHVEEPFMA LVVTLTPEEI ASPSARQHLR GLLPRLLFAA PAPMAGTRTT DSPAMPDEVN TVWAGAVRGV LDAALRADGI TMVELEPAAR EIFEVFRAGW EPRLHPETGD LADIQTWATK HPGRAARIAA LLALAEDPAT TTVGVEHVWA AVNLAEVHMA HARVALTGAG VEGAA
|
| |