Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4248 |
Symbol | |
ID | 5672603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5061441 |
End bp | 5062445 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243121 |
Product | hypothetical protein |
Protein accession | YP_001508538 |
Protein GI | 158316030 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.897652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGCCC AAAGCCCGCG AGACAGCCGA TCGCAGGGAG CGGACGTCGA ACTGGTCCGG CTTGGGCCGG CGGCAACACG GGTGCGGCTG CCAGCCCTGG CACCTGGTGC GAGCTGGACG TGTGACGTAC GCATCCACGC GCTGCAACGT GGCGTGCACC GGCTGAGCCC CGCGACGCTG CGCCGGGGCG ACCGGTTCGG TCTACTGCTC CGATCCGAGC CCACGGGTGG TGGAGAGGCC GTCGTCCGGG TCTATCCGGA GATCCTTGAC CTGCAGACGC CGACCTCCGA CCTCGGGATG ATCGAGAACG GGGTGGCGCG GCTACTCGAT CATGGGACGG AGTTCCACGG GCTTCGCGCC TATGTACAGG GCGACGACCT CCGCCAGGTC CATGCCGCCT CCAGCGCCCG CACCGGGCGG TTGCTTGTCC GGGAGAACGC TGACGACTCG GTCTCGTCGA CCTCCTGGGT CCTGTTCGAC GATCGTCGGA GTTCCTACCC GGCTGATGCT GATGGTGATC AGGCGTATGA GGACGGACTC ACCTGCGCCG TGTCCATTGC CGCCAGCGCG GTGCGCGCTG GCTCTGGCGG TGCCCTCGGC ACGCTGTCCG GCCGTACGGT GACGCACTAC GGGGACCCCG CTGATCACTG CCGGATTCTC GACATGGCGT CGGAAGCGGA TCCGGTGGTC ACCGGAGACC CCGCGGCCGG CAGCCGGGTC GCGCGGTTCC TCGCCACAGT GCGCGAGAGC GCCACGGTGT GGACCGCGGG GACAGCGGTG GTGGTGACCG GAACGGAGAG CCCGGACGCC CTCGATGTGG CAGCCGGCCT GACTGGTCTG TGTGCGCCGG TCATCGTCGT GCGGGTTGGA CGTGGGCCGA CCTCGGTGCG GATGACTCCT GAACCAGGCC TGCTCATTCT CTCCGTCGGC GACACCGAAG AAGTCGTCCG GGCATGGCCG CGGCTGTTGG GCGCGCTGAC CGGGCAGGCG CGGTGGGTTG GGTGA
|
Protein sequence | MSAQSPRDSR SQGADVELVR LGPAATRVRL PALAPGASWT CDVRIHALQR GVHRLSPATL RRGDRFGLLL RSEPTGGGEA VVRVYPEILD LQTPTSDLGM IENGVARLLD HGTEFHGLRA YVQGDDLRQV HAASSARTGR LLVRENADDS VSSTSWVLFD DRRSSYPADA DGDQAYEDGL TCAVSIAASA VRAGSGGALG TLSGRTVTHY GDPADHCRIL DMASEADPVV TGDPAAGSRV ARFLATVRES ATVWTAGTAV VVTGTESPDA LDVAAGLTGL CAPVIVVRVG RGPTSVRMTP EPGLLILSVG DTEEVVRAWP RLLGALTGQA RWVG
|
| |