Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1468 |
Symbol | |
ID | 5669872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1763336 |
End bp | 1764256 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641240388 |
Product | NmrA family protein |
Protein accession | YP_001505814 |
Protein GI | 158313306 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.524569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGCC AGAGGATCGT GGTGGTGGGT GCGACCGGCT TGCAGGGACG CGCTGTGACG GCACATCTGC TTGCTGCTGG CTGGCGGGTG CGAGCGATGA CCCGCGATCC AGGCGGCGCG CCCGCGAGGG CACTCGCGGC GGAGGGGGCG GAAATCGTAC GCGGTGAGAT GGACGACATC GACTCCCTGA CCGCGGCGAT GCACGGCGCC TACGGTGTGT TCAGCGTCCA GCCGACCGTA GGATCGGTCG GCACACCGCC GGACTTCACC GCGGCCGACG AGATCCGGTG GGGCGGCAAC GTGGCACAGG CAGCGCAGAC TACCGGCGTC CGGTTCTTCC TCTACGCCTC CGTCGCCGCG GCAGGTCGAC ACGAGACCGA AGTGCTGCCG CAGGCACTGG TGAGCAAATG GCACATCGAG CAGCGCATCG CCGGACTCGG CTTGCCTGCG GCGGTTCTGC GGCCCGCTGC CTTCATGGAG AACTACAGCG CCGGGTACTA CTTGCGCGAC AATGCCGTTA CCGCGCCCTT CGCCGCCGAC GTCCCACAGC AGGTCATTGC CGTCGACGAC GTCGCGGCCT TGGCGGCGTC GGCCTTTGCC CGACCGCAGG AATGGATCGG CCGGGCAATC GACGTGGCGG GTGACGAACT GACGCCGGTG CAGGTCACGA CTGCCATCTG CGAGGCGATC AGGCAGCCAC TGCCGTATGT ACAGACCCCG ATCGAGACAA TCCGGCAGGT CAGCGAGGAA CTCGCGTACG CCGTGGAGTG GCAGAACGAA CGCGGCTGGC GCGCCGACAT CCTCACCACG CGACAGATCC ACCCCGACCT GATGGACTTC CGCACCTGGC TCGCCGAGTC CGGCGGAACT CAGATCAGCA CATTCCTCGC CACCCAGCGT ACTGGACAAC AGGACACATG A
|
Protein sequence | MSGQRIVVVG ATGLQGRAVT AHLLAAGWRV RAMTRDPGGA PARALAAEGA EIVRGEMDDI DSLTAAMHGA YGVFSVQPTV GSVGTPPDFT AADEIRWGGN VAQAAQTTGV RFFLYASVAA AGRHETEVLP QALVSKWHIE QRIAGLGLPA AVLRPAAFME NYSAGYYLRD NAVTAPFAAD VPQQVIAVDD VAALAASAFA RPQEWIGRAI DVAGDELTPV QVTTAICEAI RQPLPYVQTP IETIRQVSEE LAYAVEWQNE RGWRADILTT RQIHPDLMDF RTWLAESGGT QISTFLATQR TGQQDT
|
| |