Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0152 |
Symbol | |
ID | 5668577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 179868 |
End bp | 180809 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239081 |
Product | hypothetical protein |
Protein accession | YP_001504525 |
Protein GI | 158312017 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0775] Nucleoside phosphorylase |
TIGRFAM ID | [TIGR03664] futalosine nucleosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00531151 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCGCA CCCTCATCGT CACCGCGGTG AACGCCGAGG CGGACGCGGT GTGCGCCGGT CTGATGGGGA GTTGGTCGCG CGGTCGCTCG CGCACTCCGT CCGGGCCCGG CGGCCTGGTC TCGCCCGACT GGGACCGGCA GTTGCTGACC ACCTCGACCC TCGGGCCCTA TGTCGGGCGC CGGCGCGGTC CGGTGACCGT TCTCGCCGCC GGCCCGGCTG GTCCGGCGTC CGCGGCCGGG ACGGCCGTCG CCCTGGCGGT CGCGAGCGCG GCCGGCGAGC CGTTCGGGCT GGTCATCTCG ATGGGGATCG CGGGCGGTTT CCGCGGCTGG GCTGAGGTCG GCGACCTGGT GATCGCCGAT CGGATGGTCG CCGCGGACCT CGGCGCCGAG TCCGGACTCG ACGAGCCGGC CGAGCCGGCA GCCCCGGTCC GGATCAACGA GATGGACCGC AACCTGGGCT TCCACCCCGC CGGGGCACCC CCGTACCAGC CCGGCCCGCC GCCGGGCCCG GTGGCGGCGG GCGCGTTCCT GACCATGGAC GACCTGGGCC TCGGCTCGTC GACCCTGCAG CCCGACGCCG ACCTCGTCCG GCGGCTGACG ACCGTTCTCG ACATCACCGG GCGCAGAATC GTGAGTGGCA CCGTCCTCAC TGTGTCGACC GTGACCGGGA CGGAGCGGCG GGCCGCGGCG CTCACAGCAC GGCTGGACCC GGTCGCCGAG GCGATGGAGG GGTACGCCGT CGGGATTGCC GCGGGCGCGT TCGGTGTTCC GGTGACGGAG ATCCGCGCGG TCAGCAACCC GGTCGGCCGG CGAGACCGCG CGGCGTGGAA CACGCAGGCC GCGCTGGGCA GCCTGCGCAC CGCGGCGGCG GCGCTCGTGG CACACCCGGA GGGCCTGGTA CCCGCCCCCC GCGCGGAGAA GGAGTCGGGG CCGACGGGCT AG
|
Protein sequence | MTRTLIVTAV NAEADAVCAG LMGSWSRGRS RTPSGPGGLV SPDWDRQLLT TSTLGPYVGR RRGPVTVLAA GPAGPASAAG TAVALAVASA AGEPFGLVIS MGIAGGFRGW AEVGDLVIAD RMVAADLGAE SGLDEPAEPA APVRINEMDR NLGFHPAGAP PYQPGPPPGP VAAGAFLTMD DLGLGSSTLQ PDADLVRRLT TVLDITGRRI VSGTVLTVST VTGTERRAAA LTARLDPVAE AMEGYAVGIA AGAFGVPVTE IRAVSNPVGR RDRAAWNTQA ALGSLRTAAA ALVAHPEGLV PAPRAEKESG PTG
|
| |