Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6385 |
Symbol | |
ID | 5674701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7752231 |
End bp | 7753937 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641245234 |
Product | stress protein |
Protein accession | YP_001510629 |
Protein GI | 158318121 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG2310] Uncharacterized proteins involved in stress response, homologs of TerZ and putative cAMP-binding protein CABP1 [COG4110] Uncharacterized protein involved in stress response |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0413348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.258333 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTCA CCCTGCCCAG GGGCGGCAAC GTGCTGCTGT CCCGGTCCGC GCCGGGCGTG GCGCGGGTGC GCGTCGCGTT CGGCTGGAGC GAGGCGCCCG GCTCGGCCGT CGACGTGGAC GGCGTCATCG CCCTCGTCGG CGGCGGCGGC GCGCCCACCC GGCTGCTCCT CGCCCACCAG GTGCCGAACC CCGCGGAACG GCTCGGCTCC CTGGCACCGC CCCGCCCGAC CAGCGGCGAG GCCGAGGCCG TCGAGGACGT CGTCGTCACC CTGGCCGCGG TGCCCGCCGC CGTCAGCCGG CTGCAGTTCG GGGCTGCGAT CTACGACTCG GCCGGCCGCG GGCAGACGTT CCGGTCGGTG CCCCGCGGCC ACATCCGGGT GCTCGACGAC GCCGACGGCC GGGAGATCGT CAGCTACGCC TTCGATGTGG AGACCGGCCT TGAGACCGCC CTGATCTTCG GCGAGCTGTA CCGGCACCCG ACGGGGTGGA AGTTCCGGGC GGTCGGCCAG GGGTACGCGG GCGGCCTGCC CGGCCTCGCC GGCGCGAACG GAGCTGGTGC GAACGGAGCT GGTGCGAACG GGGCCGGTGC GAACAGTGCC GGCGCGAATG CCGCCGTCGC GAACGGAGGC GGTGTGCCGG CGGCGCGGCC CGCCGACGTG GCCCCGTTCC TCACCCGGAC CTCTCGGGCC CGGAGCCGCC GCAAGATCGC CGATCACCTG CACCCGGCGC ATCTCGCCCA CCCAACCGGC CCGTCCACAC CGGCTCGGCC GGCACCGACC TCACAGTCGG GGCCGCAGCC ACCCGCGCCG GGAGCACGGC CTACTCCGGC GCCGACTTCA CCGCCGCCAT TCCCGCCTTC ATCGCCACAG CGGCCGGCAC CACAGCGGCC GGCACCACAG CGGCCGGCAC CACCACGGCC GGCACCACCA CGGCCGGCCC CGCCGTCATC TCCACCGGCG GCCGGCGGCG CGCGGTCGCC GCTGGACCTC GGCGAGGACG ACGAGGACAC CGCGCGGCCG GAAACGCCGG CGGCCGGCGT CCCGGAGCGC CGCCCGTCGA CCCCGGCCGA GGTCGGCGAG CGGTCCTCGC GCCACCGGCA GCGCAACGAA CACGTCAGCG CGCTCGACGA CGATCACCCG GCCACCATCT GGACGGCGGA GCAGCGCGGC TCCGGCGCCC TGACCGTGAC CCTGCGCTGG GAGACCCTGA CGACCCGCAC GGGGCTGCCG CGCCCGAGCA ACATCTACCT CGGCTGCCTC TGGCAGGCAC TGGACGGCGC CGCCGGCGTC ATCCAGCACC TGGGGCACTC CGCCAGCAGC GCCGGGCGCG CGGGCCGGCA GGTGCTGCGC CTGGGCAGCC GCGACGAGCG GGACGGCCAG ACGATTTTCG TCGACCTGAG CGCCCTCGCC ACGTTCAAGC GGTTCTTCGT CTTCGCCTAC GGGCTGCACA GCGCTCCGGA GTGGGCGTCG CTGCGGCCGG TGCTCACCGT CGCCGGGCGG TCGGGCGAAC AGCTCGCCAT CCGCCCCGGC GGGGCTTCGC CAAGTGCCCG AACCTGCGTT GTCGCCTCTT TTCACATGGT AGGCGACGAC CTGGTCATCC GGCGGGAGAA CGATTTCGTC GAGGGGACCC AGGCCGACGC CGCGGCGCGG TACGGCTGGT CGCTGGAATG GAATCCAGAC GGCATGACAC CGCGCGATAC TCCGTAG
|
Protein sequence | MTVTLPRGGN VLLSRSAPGV ARVRVAFGWS EAPGSAVDVD GVIALVGGGG APTRLLLAHQ VPNPAERLGS LAPPRPTSGE AEAVEDVVVT LAAVPAAVSR LQFGAAIYDS AGRGQTFRSV PRGHIRVLDD ADGREIVSYA FDVETGLETA LIFGELYRHP TGWKFRAVGQ GYAGGLPGLA GANGAGANGA GANGAGANSA GANAAVANGG GVPAARPADV APFLTRTSRA RSRRKIADHL HPAHLAHPTG PSTPARPAPT SQSGPQPPAP GARPTPAPTS PPPFPPSSPQ RPAPQRPAPQ RPAPPRPAPP RPAPPSSPPA AGGARSPLDL GEDDEDTARP ETPAAGVPER RPSTPAEVGE RSSRHRQRNE HVSALDDDHP ATIWTAEQRG SGALTVTLRW ETLTTRTGLP RPSNIYLGCL WQALDGAAGV IQHLGHSASS AGRAGRQVLR LGSRDERDGQ TIFVDLSALA TFKRFFVFAY GLHSAPEWAS LRPVLTVAGR SGEQLAIRPG GASPSARTCV VASFHMVGDD LVIRRENDFV EGTQADAAAR YGWSLEWNPD GMTPRDTP
|
| |