Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4622 |
Symbol | |
ID | 5672967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5511758 |
End bp | 5512786 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641243483 |
Product | hypothetical protein |
Protein accession | YP_001508899 |
Protein GI | 158316391 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATCC GCCTGCTCTA TCTGATCTTC GTGCGGGTGT GCGGGTGTGC GGGTGGCTGG TTCTCCTCGG CCGTTCGTCG GCGTCCAAGA ACATCGAGCT GCTCGTGCTG CGGCACGAGG TCTGAGGTGC TGCGCCGTAC CCAGCCCAAG CCCCAGTGGG ACTGGGCGGA CCGGGCGGTC CTCGCCGCAC TGATCCAGCT CCTACCCAAG ACGCTGCGAG CACACCGGCT GGTCACCCCC GGCACCGCCC TGCGGTGGCA TCGTCCTGCG GTGGCACCGT CGTCTGATCA CACGGAAATG GACCCACCCG CAGCGGACGG GACGACCACC GGTCAGCACG GAGATCGCGA CCCTCATCGA ACGGTTCGCG ACCGAACACG ACATAGGGCT ACACGCCAGA CCGACACGAC GTGGCGACAG TTCCCGCGCA CGCAGGCATC GACCATGCTG GCCGTGGACT TCCTCAACCT CCTCATGGAC CTCGGCGACC GGACGGCCGA CTTCCAGTTC CTGGTCCGCG ACCGCGCCGG ACAGCCCACC ACATCCTTCG ACGCGGCCCT CGCCGATGCC GGTATCGACG CGGTCACGAT CCCACCCCGG ACTCCGCAGG CGAACACCTA CGCCGAACAG TTCGTCCGCA CAGTCCGAAC GGAAGTCACC GACCGGATGA GGATCTCCGG TGCACGGCAC CTGCGCACCG TCCTGACCGA GTACGCACGG CACTACAACG GACGACGCCC ACACCGCGCC CTCCAGCTCC AACCGCCCTG GCCGACCACC CATCGCCGAC CTCACCCAGA AACGAATCAA ACGCCAGCCT GTCCTCGGCG GCCTGATCAA CGAATACGAA CGAGCCGCCT AAAACCCCAG GTCATCCCTG AAGGGGTGGG GCCGGTCAAG CGGTTCCGGT GTGGGGGTTG GGCCTGTTTT CCGGCCTGGT TCTGGGTGCC TGTGGGCGTG CGCCAGTGTC CGGCGGCGGA GTCAAGACGC AGCGCCCGCA GGGCGGAGCC CGAGGACGAA CGGTCTTGA
|
Protein sequence | MSIRLLYLIF VRVCGCAGGW FSSAVRRRPR TSSCSCCGTR SEVLRRTQPK PQWDWADRAV LAALIQLLPK TLRAHRLVTP GTALRWHRPA VAPSSDHTEM DPPAADGTTT GQHGDRDPHR TVRDRTRHRA TRQTDTTWRQ FPRTQASTML AVDFLNLLMD LGDRTADFQF LVRDRAGQPT TSFDAALADA GIDAVTIPPR TPQANTYAEQ FVRTVRTEVT DRMRISGARH LRTVLTEYAR HYNGRRPHRA LQLQPPWPTT HRRPHPETNQ TPACPRRPDQ RIRTSRLKPQ VIPEGVGPVK RFRCGGWACF PAWFWVPVGV RQCPAAESRR SARRAEPEDE RS
|
| |