Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5531 |
Symbol | |
ID | 5673861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6699468 |
End bp | 6700640 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641244387 |
Product | hypothetical protein |
Protein accession | YP_001509791 |
Protein GI | 158317283 |
COG category | [S] Function unknown |
COG ID | [COG1479] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGAAA TCTACGACGA TAGTACTGAC TGGTTGGATT CCGAGGAAGG CCTTGACGAA GATGGCCAGA TTGATGAGTA TGATCTTACT TCTACGCCCA ACGATTTCAA CGTTCTGACC ATACAAAGTT TCATCGAATC CGGGTCGGTA AAGATTCCGG GATTTCAGCG AAACTACGTC TGGGACCTTA AGCGGGCGTC GAAACTAATC GAGTCGATTG TCATTGGCCT TCCGGTTCCT CAGGTATTTC TCTACGAGGA GGGGCGAAAT TCGTTTCTGG TGATCGATGG CCAGCAGCGT CTAATGACAA TCTATTACTT CGTAAGGCAG CGATTTCCAA GAAAAGAGAA AAGGGCCCAA ATTCGCAAGG TCTTTAATGA GCATGGACGC GTTCCAGATG AAGTGCTTTA CGATGAAGAA CTATTTGAAA ATTTCAACCT AAGGCTGCCC GAAGTATCGC CGGGAAAGCC AAATAAATTC TCTGGACTCA ACTATGCCCG CCTAGGCGAC TATCAAACCC AATTCAATCT TCGTACAATT CGAAATGTCA TCATAAAGCA AGTCTCGCCA GACAATGACA ATTCTTCGAT ATATGAAATA TTCAACCGCC TGAATACCGG AGGAATAAAT CTCACTCCGC AGGAGATTCG GGCGAGCCTG TATCACTCGA AGTTCTACGA CATGCTATTT CGAGTGAATA TGAATCCAGT ATGGAGGAAC ATCGTCGGTC CCGGCGAGCC CGACCTCCAC GTGCGTGACC TCGAGGTGCT GTTGCGAGCG GTAGCTATGT GGGAGCAGGG CGGCAACTAT ACGCCGTCGA TGGTGAGATT CCTGAACAAT TATTCGAATC GCGCGAAATC TTATTCGCCG AGCGAAGTTG AGCAGGTAGA GGAAACTCTG ACCTGGTTTT TTGATCTAGC GAGGAACATT CCGCGGTCAG CCTTCACTTC CCGTGCAAAC AACAAATTCA GCACACCGTT GTTCGAGTCT GTATTTGCTG CTGTATGTAA CAGAAAAAGC CCAAGGAAAG ATGTTTCGCT AGAAGCGTCG ACGGTCGAGG CAATCAAAAA TGACCCCGAT TTTCTTCGCT ATACTACCCA GAGAACCACT GATACCTCCA ACGTAAAGGG TCGCCTAAGT GTAGCGATGA AATACGTTGA CCGCTATGTC TGA
|
Protein sequence | MVEIYDDSTD WLDSEEGLDE DGQIDEYDLT STPNDFNVLT IQSFIESGSV KIPGFQRNYV WDLKRASKLI ESIVIGLPVP QVFLYEEGRN SFLVIDGQQR LMTIYYFVRQ RFPRKEKRAQ IRKVFNEHGR VPDEVLYDEE LFENFNLRLP EVSPGKPNKF SGLNYARLGD YQTQFNLRTI RNVIIKQVSP DNDNSSIYEI FNRLNTGGIN LTPQEIRASL YHSKFYDMLF RVNMNPVWRN IVGPGEPDLH VRDLEVLLRA VAMWEQGGNY TPSMVRFLNN YSNRAKSYSP SEVEQVEETL TWFFDLARNI PRSAFTSRAN NKFSTPLFES VFAAVCNRKS PRKDVSLEAS TVEAIKNDPD FLRYTTQRTT DTSNVKGRLS VAMKYVDRYV
|
| |