Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1778 |
Symbol | |
ID | 5670180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2135861 |
End bp | 2136772 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240699 |
Product | hypothetical protein |
Protein accession | YP_001506122 |
Protein GI | 158313614 |
COG category | [R] General function prediction only |
COG ID | [COG1090] Predicted nucleoside-diphosphate sugar epimerase |
TIGRFAM ID | [TIGR01777] conserved hypothetical protein TIGR01777 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00178055 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.185841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTCG CCGTCACCGG CTCGTCCGGG CTGATCGGTT CGGCGTTGCT GCCCGCGCTG CGCGGGGACG GCCACGAGGT CGTCACCCTC GTCCGGCGCC CGCCGCGCGC CCCGTCCGAG ATCCGCTGGG ACCCGGCGGC CGGCACGCTG GACGCCGCCG CCCTGGCCGG CGTGGACGGC GTCGTGAACC TGGCCGGCGC CGGCATCGGC GACCGCCGGT GGACCGCCGC CTACAAGCAG ACCCTTCGGA CCAGCCGCAT CGACGGCACC CGCCTGCTCG CCGAGGCCCT CGCCGGCCTC GACCCGCGCC CGCGGGTCCT GCTCTCCGGC AGCGCCATCG GCTGGTACGG CACGAACGCC GGCTCGGCCG GCGCCGCGCT GGACGAGACG GCCCCGCCCG GCACCGGCTT CCTCGCCGAG CTCGCCCGTG ACTGGGAGAA CGCGACCACA GCGGCGCAGG AGGCCGGCAT CCGGGTCGTC CGGGTGCGCA CCGGCATCGT CCTCTCCGGG CGCGGTGGGA CCCTCCAACG GCTGCTCCCG ATCTTCCGGC GCGGAGCCGG CGGCCGGCTG GGCTCGGGGC GCCAGTGGCT GAGCTGGATC AGCCTGGCCG ACACCGTCGA CGCGCTGTGC TTCCTCCTCG AGGCCGACGG AGTACGCGGG CCGGTCAACC TGGTGGCGCC CACCCCGGTG ACGAACGCGG AGTTCACGTC GGCGCTGGCG CGGACGCTAC GGCGCCCGGC CTTCGCCCAG GTACCGCGCT TCGCACTACG CCTGGCCCTG GGCGAGTTCG CCGACGAGGG ACCACTCGCC TCCCAGCGGC TCGCGCCGGC CACGCTGGTC GACGCCGGGT TCCGGTTCAA CCACTCCGAC CTCGCCACCG CGCTGGCCGA CGCCGTCCAC CGCGACGCCT GA
|
Protein sequence | MKVAVTGSSG LIGSALLPAL RGDGHEVVTL VRRPPRAPSE IRWDPAAGTL DAAALAGVDG VVNLAGAGIG DRRWTAAYKQ TLRTSRIDGT RLLAEALAGL DPRPRVLLSG SAIGWYGTNA GSAGAALDET APPGTGFLAE LARDWENATT AAQEAGIRVV RVRTGIVLSG RGGTLQRLLP IFRRGAGGRL GSGRQWLSWI SLADTVDALC FLLEADGVRG PVNLVAPTPV TNAEFTSALA RTLRRPAFAQ VPRFALRLAL GEFADEGPLA SQRLAPATLV DAGFRFNHSD LATALADAVH RDA
|
| |