Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6317 |
Symbol | |
ID | 5674636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7670206 |
End bp | 7671864 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641245170 |
Product | hypothetical protein |
Protein accession | YP_001510565 |
Protein GI | 158318057 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACA CGTTGACGCG CGACGTCGTT CGCCTATTTG AGCTTCAAAA TGTCACTCTC CGCGAAGTTC GGATCAAGAA TTATCCGGAC GAGCGAAACA TCCTGGTTTA CGTCGAAGAC GGCCAGTATT CCAAGGCGCT GGAAATAGCT GAATCGATTG AACGGCAGCT GTCAAACAAT ACGTCGACAC TGATAGTGGT GCGTCGTGCT CAGCAGGCAG AACGACCATC TACAGTTACC GTCGATGTTC GCGACTCGGC GGCCCTCGAT TTTCAGAGAA TTGTAGCAGC TCGAAATCGT GTGTCCGAGG TGCAACCGAG CCTCAGCTAC GTGAAAGACT ATGCGGAGAA CCTTGCTACT ATAGCCGCGC AGCGGCACCA TCTCGTATTT GGGCGTCGAG GTGTCGGCAA GAGCACACTG CTGGTCGAAG CCAAAAGAGT AGTCGAGAAC GAGGGTTCCC TAACAACCTG GATAAATCTC CAGACTCTTC GACGCGAGAC ACCAATTCGG ATATTTCTCC GAATTGCGCG CGAGATCGTC GGAAATCTCC TGGCGGAACT TTCTAATGTG CGAGCCGACT CCGCGGTGAC AGCCGAGGCC TCACGACTTT ATGACACTCT CGCCACGCTC ATATCCAACG ACGCGACAGA AGAAAATAGC GGGACGCGGG TAATTCCAGA GATTCAGCGG GTTCTTCGGC GTGGCCTCGC CCTAATCGAC AAAGACATGT ATGTTTTTGT CGATGATTTT TATTACGTTT CCCGCGACGA GCAGCCTGAG ATGCTCGATA TTCTTCACGG GTGCGTTCGG GATTGTCGAG CCTGGCTGAA GATAGCATCG ATTAAGAATC TGACAAAATG GTGGACACCC TCCCCTCCTA GAGGTCTTCA AACTGGCCAG GATGCAGACC TAATCAGCCT AGATATAACT CTTCAGGATC CAGGGAGATT GCGCAAGCAT CTCACGACGA TACTTGAACA GTATTGCCTG GAAGTCGGTA TATCGAGGCC CACCCGCCTG TTTAGGCCAG ACTCCCTCGA CAGGCTACTA TTCGCATCAG GCGGAGTGCC CAGAGATTTT CTAGTTCTTG CCTCCGCGGC AGTTGGAAAG GCGCGCGAGA GGGAGAGCGC GAAGGTCGTC GGCGTCCAAG ATGTCAATCG GGCTGCAGGG GACGCCGCCG ACGCAAAGAT CCAGGAACTA GAAGAGGACT TGGCGTCCAA TATCGGACTT TCTGGGCGCA CACTAGAGGC TCTAGGAATT ATTCGTCGAT TCTGCCTAGA CGAAGTTGCG TCCACTTTTT ACGGGGTCGA CTTCTTCGAT AAAGAAACGA ATGCCAGGGA GTATCAAGTC TTCAGCGATC TCCTTGATCT GAGGCTCCTT CATGTCATTC ATGAAAGTGT TTCTGACGCC CACGAGGCGG GCAGAAAGCA TGAGGTTTTC ATGTTGGACC TAAGTCAGTA CACCGGGTCG CGCCTCAAAC AGGGCTTGCG TGTTTTGGAT CTACAAGGAG ACTCCCTTCT CGTGAAAAGG ACGCGGGTAA AGGATAGTGC GAGGAATGTG TCCGACTCCA AAAGCCTAAT CTCGGTGCTT CGGACTTCTC CGGGCTTTCC GCTCTCTCGT TTCAGTCAAT TGGTATCGCG CGACGGAACG GCCACCTAG
|
Protein sequence | MSDTLTRDVV RLFELQNVTL REVRIKNYPD ERNILVYVED GQYSKALEIA ESIERQLSNN TSTLIVVRRA QQAERPSTVT VDVRDSAALD FQRIVAARNR VSEVQPSLSY VKDYAENLAT IAAQRHHLVF GRRGVGKSTL LVEAKRVVEN EGSLTTWINL QTLRRETPIR IFLRIAREIV GNLLAELSNV RADSAVTAEA SRLYDTLATL ISNDATEENS GTRVIPEIQR VLRRGLALID KDMYVFVDDF YYVSRDEQPE MLDILHGCVR DCRAWLKIAS IKNLTKWWTP SPPRGLQTGQ DADLISLDIT LQDPGRLRKH LTTILEQYCL EVGISRPTRL FRPDSLDRLL FASGGVPRDF LVLASAAVGK ARERESAKVV GVQDVNRAAG DAADAKIQEL EEDLASNIGL SGRTLEALGI IRRFCLDEVA STFYGVDFFD KETNAREYQV FSDLLDLRLL HVIHESVSDA HEAGRKHEVF MLDLSQYTGS RLKQGLRVLD LQGDSLLVKR TRVKDSARNV SDSKSLISVL RTSPGFPLSR FSQLVSRDGT AT
|
| |