Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2309 |
Symbol | |
ID | 5670707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2757748 |
End bp | 2759346 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241228 |
Product | hypothetical protein |
Protein accession | YP_001506649 |
Protein GI | 158314141 |
COG category | [S] Function unknown [T] Signal transduction mechanisms |
COG ID | [COG2013] Uncharacterized conserved protein [COG2310] Uncharacterized proteins involved in stress response, homologs of TerZ and putative cAMP-binding protein CABP1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.739249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAACGC AGTTCAGTCG AGGGCAGAAG TCCCAACTGT CGGCAATCAC CGCTGGCACC GATCTGTACA TCGGGATCCA GATCAACGCG CCGGGCGAAT GGGACGTGTC CTGTTTCGGG CTGGACGGCG CCGGCCGGCT CTCCGACGAC CGGTACTTCA TCTTCTTCAA CCAGCCGAAC TCGCCCGAGT CCTCGATCCA GCTCCTCGGC GCCCAGTCCG GTGACACCCA GTCGTTCCGG GTGACGCTCG ACAAGGTCCC GGACGCGATC CAGAAGCTGT CGTTCTGCGC GGCGCTGGAC GGCCCGGGCA GCGCCGCGCA GATCACGTCC GGTTACCTGC GCATCGTGGC CGGTGGTACG GAGGTGCTCC GCTACGCGTT CACCGGCGCC GACTTCTCCG ACGAGCGCGC CGTCATGATC GGTGACGTCT ACCGCAAGGG CGTCTGGCGC GTCGCGGCGG TCGGCCAGGG GTTCCGGGGT GGCCTGGCGG AGCTGATCCG CAGCTACGGC GGCGAGGTCG CCGACGAGCC CGAGCCCACG CCCGCTCCGG CGGCGCCTGG GTTCGGCGCG CCGCCCGCGC CCGCCTTCAG CCAGCAGAAG CCGCCTGCCG CGCCGGGCTT CGGCGCCCCG CCCGGGCCCC CGCCCCCGCC CCCGCCGCCC CCGCCGGCAC CGCCGGCACC GGCGCATCCC GGCCAGGGCT ACGGTCAGCC GCAGCCGGCC TACGGCCAGG CCGGCGGCGC GCAGCAGGGC TACGCCCAGC CGGGCTACGC TCAGCCACAG CCCCCGCCGG CGATGCCCGG CGCCCAGGGG TACGGCGCGC CCGGCCAGCA GCCACCCGGC CAGCCGATGC CGGGCCACCC GATGCCCGGC CAGCAGCAGC CCGGTCAGCC GATGCCGGGC GGGTTCGGGC CCACGGAGGT CCTGCCGTCG CAGGCCCGCC CCGTCCAGCC CGGTGCCATG AACAGCCTCA ACCCGTACCG CGAGGTGCCG ACCGCGGGGC GCTGGACCCA GCAGAACAGC AAGCTGGTCA AGGTCACCCT GGGCCCGGAG GCGCTGGCGC TGCGCGGTTC GATGGTCGCC TACCAGGGCA ACGTCGAGTT CGACTACAAG AGCGGCGGGA TCCGCGGGCT GATCGAGGAG AAGCTCACCG GCCAGGGCCT CAAGCTCATG ACGTGCAAGG GCAACGGCGA GGTCTTCCTT GCCCAGGACG CCTCCGACCT GCACATCGTC GAGCTCGGCA ACCAGTCGCT GTGCATCAAC TCGAAGAACC TGCTGGCGAT GGACGCCACC GTGCGCTCGG AGGTCCGCCG CATCGAGAGC CCCGGCATCC CCGGCGGCGG CTTCTTCCAC TTCGAGGTCT CCGGGCCCGG GTCGGTCGTC GTGATGACCA AGGGCACGCC GATGACCCTC AACGTCGCCG GTCCCACGTT CGCCGACATG AACGCGCTGG TGGCGTGGAC GTCCGGCATG CGGGTGAGCG TGTCCACCCA GGTCCGCATC TCCCGCCAGA TCTACGCGGG AGCCAGCGGC GAGTCGTTCG CGTTGCAGTT CATGGGGTTC GCCGGCCACT TCGTCGTCGT CCAGCCGTAT GAGGTCTGA
|
Protein sequence | MATQFSRGQK SQLSAITAGT DLYIGIQINA PGEWDVSCFG LDGAGRLSDD RYFIFFNQPN SPESSIQLLG AQSGDTQSFR VTLDKVPDAI QKLSFCAALD GPGSAAQITS GYLRIVAGGT EVLRYAFTGA DFSDERAVMI GDVYRKGVWR VAAVGQGFRG GLAELIRSYG GEVADEPEPT PAPAAPGFGA PPAPAFSQQK PPAAPGFGAP PGPPPPPPPP PPAPPAPAHP GQGYGQPQPA YGQAGGAQQG YAQPGYAQPQ PPPAMPGAQG YGAPGQQPPG QPMPGHPMPG QQQPGQPMPG GFGPTEVLPS QARPVQPGAM NSLNPYREVP TAGRWTQQNS KLVKVTLGPE ALALRGSMVA YQGNVEFDYK SGGIRGLIEE KLTGQGLKLM TCKGNGEVFL AQDASDLHIV ELGNQSLCIN SKNLLAMDAT VRSEVRRIES PGIPGGGFFH FEVSGPGSVV VMTKGTPMTL NVAGPTFADM NALVAWTSGM RVSVSTQVRI SRQIYAGASG ESFALQFMGF AGHFVVVQPY EV
|
| |