Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6609 |
Symbol | |
ID | 5674924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8043082 |
End bp | 8044014 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641245460 |
Product | NmrA family protein |
Protein accession | YP_001510852 |
Protein GI | 158318344 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTGA TCCTGGTAAC CGGTGCCACC GGCACCATAG GCGGCAAGGT ACTCGACATC CTGGCGGCTC GCGGCCAGCG TGTCCGGGCT GTGACCCGTG ATCCGAGAAA ACTACCTACG CGTCCGGGAG TGGAGGCTGT CCGGGCCGAT TTTGACGAAC CCGCGTCGCT CCGGCAGGCG GTGGCCACGG TCCAGGCAAT GTTCCTGCTC ACCGTGCTGG CCTCTCCCAC TCCGCGCCAC GACCTGGCCG TGCTCGACGC GGCACGATCG GCCGGAGTGC GCAGGGTGGT GAAACTTTCC GCCATCGGCA CTGGCGAGAA GATCGGCCCC GACGTGGTCG GGGCGTGGCA TCTGGTGGCC GAGCGAGCGG TGCGAGACAG CGGTATGGGG TGGACCGTGC TGCGCCCATC CAGCTTCGCG TCGAACACCC TGCAATGGGT CGACGCGATC GGCAAGGGAC AGCCCGTGCC CGATCTCACC GGAGCAGGCC GGCAGGGAGT CGTCGACCCC TACGATGTCG CCGCCGTCGC GGTCGAGGCC CTCCTGTCGC CAGCACACGT CGGCAAGATC TACACACTGA CGGGACCAGG GCTGCTGACC GTGGCCGAAC AGGCCGACTG TCTGTCGCGG GTGACGGGAC GCCGGGTCGA CACCGTCGCC GTGAGCCTCG AGCAGGCAGC CGATCAGATG CTCGCCGCCG GGATGGACAG GTCCACCGTC GAGGTGATCA TTACCGGCTC CGCCTGGGCG CGCGCCGGCC ACAACGCCGT CCTGACCGAC GACGTGGCCG CGATCCTGAA CCGTCCCGCC ACCAGTTTCC AGAGCTGGGC CTACCGGCAC GCCGTGCCTT CGGCCCCCGC AGGGCCGCCG CGGCGCCCGC AGCCAGACCC ACGACGCCGC CGATGGCCCA GCCTCGGGTG GCTGCGACAC TGA
|
Protein sequence | MAVILVTGAT GTIGGKVLDI LAARGQRVRA VTRDPRKLPT RPGVEAVRAD FDEPASLRQA VATVQAMFLL TVLASPTPRH DLAVLDAARS AGVRRVVKLS AIGTGEKIGP DVVGAWHLVA ERAVRDSGMG WTVLRPSSFA SNTLQWVDAI GKGQPVPDLT GAGRQGVVDP YDVAAVAVEA LLSPAHVGKI YTLTGPGLLT VAEQADCLSR VTGRRVDTVA VSLEQAADQM LAAGMDRSTV EVIITGSAWA RAGHNAVLTD DVAAILNRPA TSFQSWAYRH AVPSAPAGPP RRPQPDPRRR RWPSLGWLRH
|
| |