Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7196 |
Symbol | |
ID | 5675497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8787956 |
End bp | 8789509 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641246033 |
Product | hypothetical protein |
Protein accession | YP_001511421 |
Protein GI | 158318913 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.594351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCCAGG TCTGCTGCCT AGCCATTGTC ATCGTGGCCG GGAACGGCCC GTTCCTCAGC GCCTGGGCGA CCGACAAGTA CGACACCTGG CGCTCGCACC AGCCGGACTA CATGCGTCAG TACGGCCGCT GGGACATCGT CGCCGACCCC GGTTCGCGCT CCATCCACGC GGCGCTGCTG CGGACCGGGA AGATTCTGCT GCTGGCCGGT TCGGGAAACA ACCAGGCCAA CTTCGACGCC AAGCGCTTCG AGACCCTGCT GTGGGACCCG ACCGCCAACA CCTTCGAGCA GGTCTACACG CCGTGGGACG TCTTCTGCGC CGGCCAGGCG TTTCTTCCGA GCGGTGAGCT GCTCATCGCC GGCGGCACGA AGAAGTACGA GGTCCTCGCC CAGGACTCGC CGGACGGCAA GAAGCAGGAG TACCAGGGGC TGAAGGACAG CTACACGTTC AATCCGCAGA CGGAGCGGTA CGAGAAGACC GGCGACCTGA ACTTCGCCAG GTGGTACCCG ACCCTGGTCA CGCTGGCCAG CGGCCAGGTG GTCGCGGTCT CCGGCCTGAA CGAGAAGGGC GACATCGACC CAGGCAACAC GGAGTGGTTC GACCAGGCGA ACCGGACGTG GAACCACAAC GAGGGGCTGG TCAAGGAGTT CCCGACCTAC CCGTCGCTGC TGCTCGCCGG GGACGGCCGG CTGTTCTTCT CCGGCGCGAA CGCCGGTTAC GGGCCGGCCT CGCTGGAAGC CCGCCAGCCC GGATTCTGGA GCCTCGCCGA CGGTACGTTC CAGGCGGTTC CGGGCCTGCC TCAGCCGGAG ATCAACGAGA CCGCCGGAAC GGTGATGCTG CCGCCGGCCC AGGAACAGCG GGTGATGTTC GTCGCCGGCG GCGGCGTCGG CGACACCCAG GTGGCCACCG CCCGCACCGC GATCGTCGAC CTGGACGACC CGAATCCCCA TTACGTCCCG GGGCCGAACA CCACTGTCGC GAAGCGGTAC CCCGGCGTCG TCGTCCTGCC GGACGACACC GTGCTCGTCT CCGGCGGCTC CACCGCCTAC CGCCAGAAGG ACACCCAGAC GGCGGAGATC TACCACCCGG ACACGAACAC CTTCACCACA GCCGCCGACC CGCTCGTCGG CCGGGACTAC CACTCGAGCT ATCTACTGAT GCCCGACGGC CGGGTGGCGG TTTTCGGTTC CAATCCGCTG AGCGACGACA ACTTCTTCGA GACCCGGATC GAGATCTACA GTCCGCCCTA CATGTACCAG GGTGAGCGGC CGGTCATTAA GACCGCGCCG ACCTCGGTCA CCCGTGGCAC CACAATCGAT CTGGGCGTCT CCCAGGAGGT TTCCAAGGTC CGCCTGATCC GTCCGGGCGC CTACACGCAC GTGACCGACA CCGAGCAGCG CTCGGTGGCC CTCCCCCTGG TCAGCCAGGC GAACGGCAAG GTAACGGTGA GTGTTCCCGA CAACGCCAAC CTGCTGCCGC CGGACTGGTA CATGCTCTTC GTCGACAACG GCGAGAACAT TCCGTCCGTC GCCACCTGGG TGCAGGTCCA GTGA
|
Protein sequence | MVQVCCLAIV IVAGNGPFLS AWATDKYDTW RSHQPDYMRQ YGRWDIVADP GSRSIHAALL RTGKILLLAG SGNNQANFDA KRFETLLWDP TANTFEQVYT PWDVFCAGQA FLPSGELLIA GGTKKYEVLA QDSPDGKKQE YQGLKDSYTF NPQTERYEKT GDLNFARWYP TLVTLASGQV VAVSGLNEKG DIDPGNTEWF DQANRTWNHN EGLVKEFPTY PSLLLAGDGR LFFSGANAGY GPASLEARQP GFWSLADGTF QAVPGLPQPE INETAGTVML PPAQEQRVMF VAGGGVGDTQ VATARTAIVD LDDPNPHYVP GPNTTVAKRY PGVVVLPDDT VLVSGGSTAY RQKDTQTAEI YHPDTNTFTT AADPLVGRDY HSSYLLMPDG RVAVFGSNPL SDDNFFETRI EIYSPPYMYQ GERPVIKTAP TSVTRGTTID LGVSQEVSKV RLIRPGAYTH VTDTEQRSVA LPLVSQANGK VTVSVPDNAN LLPPDWYMLF VDNGENIPSV ATWVQVQ
|
| |