Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4241 |
Symbol | |
ID | 5672596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5049153 |
End bp | 5050283 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243114 |
Product | hypothetical protein |
Protein accession | YP_001508531 |
Protein GI | 158316023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.468194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCGACG GACGTCCGGC GCAGGAGGCT CGGCGGGCCG GAGGCCGGGC GGAACGGGCC ATCCGGATCA CCACCGTGAT CGCGGTGGCC ACGGTCGCGG CCGTGGCGGG TTTCGTCTCC TACCGCCACA TGCGCGGCGT CGCCCTGCAA TACGGCGAGG ACGCGATGAC CTCGGCCGTT CTCCCGTTCA GCGTGGACGG GCTGATCGTG GCCGCGTCGA TGACGATGCT GGCCGACCGG CGGGCCGGCC GGCGGCGTTC CTGGCTGTCC TACACACTGC TGATGCTGGG GGCGTGCGCG TCACTGGCGG CGAACGTCCT GCACGCCGAG CCGACCACCG CAGCCCGGAT CATCGCCGGC TGGCCTCCGC TGGCGCTGCT CGGCTCGTAC GAGCTGCTCA TGCGCCAGAT CCACCCGACC AGCCGGCGGA CAGCCCGCCA GCAGGCCGCC GCCGCCGCGG CGCCGGCCGA CGTCCCGGCT GTCGCTCCCC AGGCTGATGG CCCAGCTGCC GGTGGGCAGC CAGGCGGTGG GCAGCCGGTT CCGGCGCCGG GGTCCATTCC GCCGCAGCGG CAGCGGATGC CCGCCGTCGG CGGCGACTTC CCGGCTGGCA CCCCCGCTCC CACCGCGACA GCTCCCGCCG CAGCCGACAA CGCTGTGGCC GACAACGCGG TGGCGGTGGT CGACCTCTCG GTGACCGCAC CCATGGTCAC GCCCGCCCCC GCGACGGCCC AGGCCCTCGA CCCGGCACCC GGGCGGACGC CTGCGCCGGG CCCGGCCGTC CCGACAGGTG CCTCGACAAC CGTCCCGACA GGCGGGCCGG CGTCCACGGC CGGGGCCGGC TCCGCCGGCG GAGTGGACAC CGCGGACTCC TCGGTGAAGC GTGAGGCGAT CATTCGCGCG CTGGACGAGA CCGGCGGTTC GGCGACCGCG GCGGTCACCC TGCTCGGCCG GTGGGGCATC ACGGTGAGCA AGAGCTGGGT GTACCAGGTG CGCAAGGAGA CCCGGCACGC CGACGTGCAG ACCGGGCCGC TGGTCATGCC CACGCACCGT GCCCACCCCG CCAGCCGTGG GCGGCGGCGC GGCATGGCGC CCGACCGGCC GCTCGTGGCA CCGACGACGA CCAGGGGCTG A
|
Protein sequence | MVDGRPAQEA RRAGGRAERA IRITTVIAVA TVAAVAGFVS YRHMRGVALQ YGEDAMTSAV LPFSVDGLIV AASMTMLADR RAGRRRSWLS YTLLMLGACA SLAANVLHAE PTTAARIIAG WPPLALLGSY ELLMRQIHPT SRRTARQQAA AAAAPADVPA VAPQADGPAA GGQPGGGQPV PAPGSIPPQR QRMPAVGGDF PAGTPAPTAT APAAADNAVA DNAVAVVDLS VTAPMVTPAP ATAQALDPAP GRTPAPGPAV PTGASTTVPT GGPASTAGAG SAGGVDTADS SVKREAIIRA LDETGGSATA AVTLLGRWGI TVSKSWVYQV RKETRHADVQ TGPLVMPTHR AHPASRGRRR GMAPDRPLVA PTTTRG
|
| |