Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1972 |
Symbol | |
ID | 5670373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2369471 |
End bp | 2370520 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240893 |
Product | hypothetical protein |
Protein accession | YP_001506315 |
Protein GI | 158313807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.551311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGACA CCGCGGTACG CCTGACGGAG GACGACGCCC GCGCGATCGT GGCGCAGGCC ACGCTCGCGC CGTCGATCCA CAACACCCAG CCCTGGCGGT GGCGCTTCGA CGCCGACGGT CTGCACCTGT TCGCTGATCC CGAGCGGTTG CTGCACGTCG TCGACCCGGA GGGACGCCAG CTCCTGGTCA GCTGCGGGGC CGCCCTGACC TTCGCCCGGC TCGCCGCGCG CTCGCGCGGC CTGCTCCCCG TCGTGGCCCT CGGCCCGCTC GACGACGGCG CCCCGCCGTC CGCGGACGTC CCGCTGGCCA CGATCGCCGT AGCGGACCGC CGCCCGCCCG CGCCGGCCGA GGCGGAGCTG GCGGCTGCGA TGTCCAACCG GCACACCGAC CGCCGTCCGT TCCTGACAGG GGAGCGGGGG CGGCTCGGGG CCGACGACCT CGCCGCGCTG CGCCGCGCCG CCGAGGCGGA GTCGGCCTGG GTGCGGTTCG TGGAGAGCGC GGACGCCCGG GTCGAGACGT CGGTCCTGCT CTCCCGCGCG GACTGGCAGG AGGCCCACGA TCCCGCCTAC ACCGAGGAGC TGCGGCACTG GAGCCGTACC TCGCCGCAGG CACGCGACGG CATCCCCCGC GAGGCCGTCG TCGGTGGCGC GGCGACGCGC CAGTCCGAGT TCGTCCTGCG GGACTTCGAC GTGGTCGGCG GCCTCGAACC GGCCGGCGCG GCCGGTTCGT CCGAGCCGGC GGTGGAGCAG CCGGCGGTGG AGCGGCCGAC GGTCGTCGCC ATAGGCACCG ACTCGGATCG GCCCACCGAC CGGCTGCTCG CCGGCGGCGC GACGGGCCGG GTGCTGCTTA CGGCGACGGC GCGCGGGCTC GCCGCCTCAC CGCTGGGGCA GGTGCTGGAC GTCTCCGCCA TCCGCGAGCT GATGCGCTCG GCGACCGGGG GCATCGGTCA CGTGCAGATG CTGCTGCGGC TGGGTCGCCC CGACCCCGAC CAGCCGCCGC TGGCGGCCAC CCCCCGGCGG CCGGTCGAGG AGATCCTCGA CATCGCCTGA
|
Protein sequence | MVDTAVRLTE DDARAIVAQA TLAPSIHNTQ PWRWRFDADG LHLFADPERL LHVVDPEGRQ LLVSCGAALT FARLAARSRG LLPVVALGPL DDGAPPSADV PLATIAVADR RPPAPAEAEL AAAMSNRHTD RRPFLTGERG RLGADDLAAL RRAAEAESAW VRFVESADAR VETSVLLSRA DWQEAHDPAY TEELRHWSRT SPQARDGIPR EAVVGGAATR QSEFVLRDFD VVGGLEPAGA AGSSEPAVEQ PAVERPTVVA IGTDSDRPTD RLLAGGATGR VLLTATARGL AASPLGQVLD VSAIRELMRS ATGGIGHVQM LLRLGRPDPD QPPLAATPRR PVEEILDIA
|
| |