Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3979 |
Symbol | |
ID | 5672340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4765455 |
End bp | 4766675 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641242858 |
Product | hypothetical protein |
Protein accession | YP_001508275 |
Protein GI | 158315767 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.98735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.25715 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCTCC CGCTGGCTGC CGGGCCTGGC GCCGAGAGAC GGGAGAGACC TGGTGCTGTT CGTCGGGACG ACTGGGCGGA GGACCACCAC GACGTGGATG TGCAGGACGA GACAGGCAGG CGGCTGACGA AGGCCCGGCT GCCCGAGGGC ACGGCCGGGA TCGCCCGGCT GCACACGCTG CCCTCCGCCG GAGCGGTATC CCGTCCACTC CCGGGGATCT CGCCTTCGAG CGGCTCACCC CGACCGGCAT GGCGATCACC GCCGCCACCT GCCAGGCGCC CTGTCCCAAG GCAGTTGACC TCATCGACCG ATGTGGACGC GTCGACCTGC CTATATTATG AGTATGTGAC CCGATCTGTA CCGGAAAAGA CGCTAGAGCA TTGGGCCAGC CAGTACATCA CATACCGGTA CAGGTCGAAG GCCGCCCTCT GGTGGCCCGC GAGGGGGGAG GACATCCAGT TCGGTCGACT TCCGTCAAGA CCCGGAAAGA TCGTACAGAT CGAGCTGAAG ACCACTGAAG TCGTCCGTCG CGGCGTCCAC GAGGTGGAGG TCGACCTCGG ACAGCTCTGG GAGTATCGGC GTCTACCGTT GGGAAAGCAA CCATTCTACG CCTTCCCCCG GCCGGGTGCC GACTGGCCCG GAGATCTCGG CGAGGCCGCC GCCAAGACAG GCCGTGCCGT CTCCGAACTC GGATACCGCA GGGGTAAGGA ATTGTGGTTC GCGAACTGGA TGGTCGTGAT GACCACCGAG CAGGTCGCGG ACGTCATGAG CAAGGAGCTT GCACTGCACG GTTCGGAGAA ACGAGGCGAG CGGCGGCCGC TGGTGCGTTT CGCCGGCAAA TCCGAACCGA GATGGGGACT TGAAGCGGCC GATCCGGAGG TGATCCGCTG GCGAGACTTC TGGTCGATTC TCGACCGGTG CGGTCGGGAT CGCTGGCCAC AGTTGATTCG CCTGCCCGCC GTCTTCCTTG ACATCCGCGA CGGTCGGGAG CTTTCCGGCG GTCGAGAGGT CTACACTCGT CAGCAGCTCA GGGAGCTTCT GGACGGCGCG GCAGGGGCGC AGGGAGATGT CGGGCACCTG GTGATTCTCG AACCCAATCC TGATGGTCAC TACCGACGTT CAGATTTCTT GAACAGCAAC TCTGCCAGAG TAGGTGCAGC TCCCAGTAGA GGCGATACCG AGCAAAACGA CCATCGTGCG GTCGTCTCGC TTGATGCTTG A
|
Protein sequence | MDLPLAAGPG AERRERPGAV RRDDWAEDHH DVDVQDETGR RLTKARLPEG TAGIARLHTL PSAGAVSRPL PGISPSSGSP RPAWRSPPPP ARRPVPRQLT SSTDVDASTC LYYEYVTRSV PEKTLEHWAS QYITYRYRSK AALWWPARGE DIQFGRLPSR PGKIVQIELK TTEVVRRGVH EVEVDLGQLW EYRRLPLGKQ PFYAFPRPGA DWPGDLGEAA AKTGRAVSEL GYRRGKELWF ANWMVVMTTE QVADVMSKEL ALHGSEKRGE RRPLVRFAGK SEPRWGLEAA DPEVIRWRDF WSILDRCGRD RWPQLIRLPA VFLDIRDGRE LSGGREVYTR QQLRELLDGA AGAQGDVGHL VILEPNPDGH YRRSDFLNSN SARVGAAPSR GDTEQNDHRA VVSLDA
|
| |