Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4807 |
Symbol | |
ID | 5673148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5739241 |
End bp | 5740593 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243663 |
Product | hypothetical protein |
Protein accession | YP_001509079 |
Protein GI | 158316571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.280649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.472222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCGCT CGGCGCCGGC ACCGCGGGCG GCCCGACCGA GGCGACCGGG ACCTTCGACC ACCTCCTCGT GCGGGGCGAG GCTCGCTCGG CCGGCGCGGC CGGCCAGGTG GCGCGGCCGG CTGCTCGCCG CGAAGGCCGT CGTGGTCGGC GCCGTCGCGT TCGTGACCGG CCTGGTCGCC GCCGCCGTCG CCGTCGTCTT CGGCCAGCGC GTGCTGCGCG GCAACGGCGT CTACGTCCAC CCCGCGACGA CGCCGACCGA GCTGCGCGTG ATCGTCGGGA CCGCCGCGCT GCTCGCCGTC GCCGCGGTCC TGGCGCTCGG GCTGGGGACG TTGCTGCGGC GCGGTGTCAC CGCGGTGGCG ATCGCCGTCG CCGTGATCGT CCTGCCGTAT CTGCTGGCCA TGACCGTCCT GCCGGACGGG GCCGCGGTGG CTGCTGCGGG TGAGCCCGGC GGCGGCGTTC GCGCTGCAGC AGACGGCGAC GCAGTACCCG CAGGTCGCCA ACCTCTACAC GCCGGCGAAC GGGTACTTCC CCCTCGCCCC GTGGGCCGGC TTCGGGGTGC TCGCCGGGTG GGCCGCCCTC GCCCTGGGCA CGGCCGCCGT CCTTCTCCGG CGGAGGAGCG CGTGAGATCG GCCCTGCACG CCGAGTGGAC CAAGCTGCGG ACCTCGCCCG GCACGCTCGG GCTGGCGCTC GCCGTGATCG TGAGCACGGT CGGGTCGAGC GCCGCGGTGG CCGCGGCGAC CGGGTGCGCG CCAGGAGGCT GTGGGCAGGA CCTGACGAGA CTGAGCCTCA CCGGGGTCCA GGTCGGTCAG GCCGTCGTCG CCGTCCTCGC GGTCCTGGTG ATCGGCGACG AGTACAGCAC CGGGATGGTC CGGGTCACGC TCACCGCGCT GCCCCTGCGG ACGACTGTCC TGGCCGCCAA GGCCGTCGTC GTCGCCGGGG TCGTCGCGGT GACGGCCGTG CCCGCCGTCC TCGGGTCCCT GACCGTCGGG TGGTTCATCC TTCCCGAGCA GGAGGTCGTC CCCCGGGCGG CCGTCGGTTC CGTGCTGTAC CTCGTCCTCA TCGGCCTGTT GGGCCTGGGA ACGGCCACCG CCGCGCGGAA CCCGGCGGCT GCGTCCGGGA TCGTCCTGGG ACTGCTGTAC GTGTTCCCGA TCATCGCCCA GGTGGTCACC GACCCGGGCT GGCGGCGGCA CCTGCAGCAG GCCGGGCCGA TGAGCGCCGG GCTCGCCGTC CAGGCCACCG GCGACGTCGA CGCCGTGCCG ATCGGACCGT GGCAGGGACT CGGCGTGCTC ACGCTGTGGG CGCTGGCCGC GCTCCTCACC GGCGGCCTGC TCCTCGCCCG GCATGACGCC TGA
|
Protein sequence | MDRSAPAPRA ARPRRPGPST TSSCGARLAR PARPARWRGR LLAAKAVVVG AVAFVTGLVA AAVAVVFGQR VLRGNGVYVH PATTPTELRV IVGTAALLAV AAVLALGLGT LLRRGVTAVA IAVAVIVLPY LLAMTVLPDG AAVAAAGEPG GGVRAAADGD AVPAGRQPLH AGERVLPPRP VGRLRGARRV GRPRPGHGRR PSPAEERVRS ALHAEWTKLR TSPGTLGLAL AVIVSTVGSS AAVAAATGCA PGGCGQDLTR LSLTGVQVGQ AVVAVLAVLV IGDEYSTGMV RVTLTALPLR TTVLAAKAVV VAGVVAVTAV PAVLGSLTVG WFILPEQEVV PRAAVGSVLY LVLIGLLGLG TATAARNPAA ASGIVLGLLY VFPIIAQVVT DPGWRRHLQQ AGPMSAGLAV QATGDVDAVP IGPWQGLGVL TLWALAALLT GGLLLARHDA
|
| |