Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0341 |
Symbol | |
ID | 5668765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 409243 |
End bp | 410526 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239273 |
Product | hypothetical protein |
Protein accession | YP_001504713 |
Protein GI | 158312205 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.156991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00506023 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGTCTC AGGTGAGCAT CGATCTCAGG CCGGGCGTAC CGAAGGAAGG CGGGACCCGT GCGGTGGCGG CGAAGCCGAG GAGGCGGCGG CGGAAGGATC GTACCGGGTT CACGGCGGTG GCCCGGGCCG GTCGGCGCCG CGCCGCGGGA TTTCTGCTGG CGGTCGTTGT TCTAGTCGGC GCCAGTGCCT GCCAGGCCGA ATACAAGCCA CCGTTCGCAC CCATCGTGTT CGCCATCGAC GAGAACGGCA ACATCGACGT CAGCGCGAGT CGTGACCTGG TGACCCCGAT CGGCACCTTC ACGATCAGCG AATCGGTTGC ATTGCCGCGG GACGTCCCAT CGGACCGAAC CCTGATGATC CTTCGTCACA AGGTCGCCGG TGAGCTGAAG GATGCTTGGT TCACGCTACT GGCCGCGGTG AACCTGCACT TCTCCGTGGA CGGCGCGAAC CGACTCGTCC CGCAGGCCGA GGAGAACGTC GCGCTGTTGG AGGTGACCGG CCCGGCGACC GGGGTGGTCG CCGAAGCGAC AGACGACGAC TCCGGTGACC GTTTCGAGTC CGAGCAGGTC GAGGTACTGC CGGAGGTTCC CGAGGACACG GACCCCTCGG CCACACCGGA CGACGGCGGG CCGAGCGGAG GGCCGAGCGG CGGGTCGGGT GAGCCGGGCA TCGAGGTGAC ACCGGAGACG CTGAGTTGCG ATGACTCGGG GTGCGCCGGG ACCGTGACCG TGGAGAGCAC CGGCACCGGC ACGCTGCGGG TCACCTCGAC CGAGATCATC GGTCCTGACG CCGACGCGTT CTCCGTGGAC GCGGGCTGCG AGGCCGAGCT GCCTCCCGGC GGGCAGTGCA CTCTCAGCGT CGGCTACGTG CCGCTGGACA GCGGCGAGGC GGCCGCCGCG ACGCTGGTCA TCCACCAGAA TCTCAGCGGT CCCGCCAACG AGGTGAACCT GGAGGGGAGC GCCGGGACGA CGCCGCCCGG CCCGGAGCCC GGCATCGCCG TGACGCCGGA GACGGTGTTC TGCACCAGCT CCGCCTGCCA GCCGGTGACG GTCGAGAGCA CCGGCGACGA CCCGCTGGCC GTCACGTCCG TCGAGATCGT GGGTTCAGGC GCCGCCGCGT TCAGCTACAC GAGCGACTGC GAGGGCGCGT CCCTGCCGAC AGGCGCCCGG TGCGTCGTCA CCCCCGAGTA CACGCCGCAG GGCGGCTCCG ATGCGACGGC CACCCTGGTC ATCCACCACA ATCTCGCCGG GCCCGCGACC GAGGTCACCC TCTCTGCCTC CTGA
|
Protein sequence | MTSQVSIDLR PGVPKEGGTR AVAAKPRRRR RKDRTGFTAV ARAGRRRAAG FLLAVVVLVG ASACQAEYKP PFAPIVFAID ENGNIDVSAS RDLVTPIGTF TISESVALPR DVPSDRTLMI LRHKVAGELK DAWFTLLAAV NLHFSVDGAN RLVPQAEENV ALLEVTGPAT GVVAEATDDD SGDRFESEQV EVLPEVPEDT DPSATPDDGG PSGGPSGGSG EPGIEVTPET LSCDDSGCAG TVTVESTGTG TLRVTSTEII GPDADAFSVD AGCEAELPPG GQCTLSVGYV PLDSGEAAAA TLVIHQNLSG PANEVNLEGS AGTTPPGPEP GIAVTPETVF CTSSACQPVT VESTGDDPLA VTSVEIVGSG AAAFSYTSDC EGASLPTGAR CVVTPEYTPQ GGSDATATLV IHHNLAGPAT EVTLSAS
|
| |