Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6156 |
Symbol | |
ID | 5674477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7489894 |
End bp | 7491201 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641245008 |
Product | hypothetical protein |
Protein accession | YP_001510406 |
Protein GI | 158317898 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.665559 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACC GCAGCGACGG CGACCGCGAT GACAGCGGAC GCCACGGCAC CGGACCTGGT GGGCCCGCCC ACCACCATGA CGGGGACGGA CCTTCCGGCG GGCGGGACGA ACTCGGCGAC GACTGGGGCC GGGTGGATGC CGACGGCACC GTGTACCTGC GCACCGCCGA CGGTGAGCGC GCGGTCGGCT CGTGGCGCGC GGGCAGCCCT GAGGAGGGCC TGGCCCACTT CCGCCGTCGC TACGACGACC TCCTCGCCGA GGTGGTGTTG TTGGAGCGGC GGCTGACGGT GAGCGGCGTC GACCCCGGCG GCATCGCCGG CAGCGCCCGC CGGCTGCGGG AGGGGCTGGC CCAGGCCTCG GTCGTCGGCG ACGTCGACGC CCTGGCCGCG CGGCTCGACG CCGTCCTGGC GGCCACGGAC ACCCGCCGCA CGGAGCTCGC CGCCGAGCGG GCCCGCCGGG TCGCCGCCGC GGTCACCGCC AAGGAGGAGC TGGTCACCGA GGCCGAGCAG CTCGCCCGCA GCTCCGAGTG GAAGGTGACG AGCGAGCGTT TCCGGACCAT CGGCGACGAT TTCCGCGCGA TCACCGGTGT CGACAAGCGG ACCGACTCGG CGCTGTGGCG GCGGATCGCC GCGGCCCGCG ACGAGTTCAC CCGCCGCCGC ACCTCGCACT TCGCCGCGCT CGACACCCAG CGCACGCGCT CACGCGAGCG CAAGGAGGCG ATCATCGCCG AGGCCGTCGC GCTGGCGGAC TCGACGGACT GGGGCCCGAC GACCGCGCGG TACCGCGCGC TGATGGTCGA GTGGAAGGCG GCCGGCCGGG CCGCCAAGGA CGTCGACGAC GAGCTGTGGG CCCGGTTCCG GGCCGCGCAG GACGGCTTCT TCAGCCGGCG CAACGCCGTG AACGCCGAGC GCGACGCGGA GCAGATCGCC AACCAGGCCC GCAAGGAAGA GCTGCTCGTC GAGGCCGCCG CGCTCGATCC CGTCGACGTC GAGCGGTCGC TGCGCCGGTA CCGCGAGATC CAGGAGCGCT GGGACGCGAT CGGCCGGGTG CCCCGCGAGG CGGTCGGCAG CCTGGAACGC CAGCTCAACG CCATCGGGGA CAAGCTGCGC GATGCCTCCG ACGCCCGTTG GGACCGTCGT GACATCGCCG AGTCCCCGTT CCTGACGAAG CTGCGCGAGT CGGTGGCGAA GCTCGAGGCG AAGCTGGAGC GCGCCCGCGC CGCCGGCCGG GCCCGCGAGA TCACCGAGAC CGAGAACGCC CTCACGACCC AGCGCGCCTG GCTGGCGCAG GCCGAGAAGG GAAGCTGA
|
Protein sequence | MSDRSDGDRD DSGRHGTGPG GPAHHHDGDG PSGGRDELGD DWGRVDADGT VYLRTADGER AVGSWRAGSP EEGLAHFRRR YDDLLAEVVL LERRLTVSGV DPGGIAGSAR RLREGLAQAS VVGDVDALAA RLDAVLAATD TRRTELAAER ARRVAAAVTA KEELVTEAEQ LARSSEWKVT SERFRTIGDD FRAITGVDKR TDSALWRRIA AARDEFTRRR TSHFAALDTQ RTRSRERKEA IIAEAVALAD STDWGPTTAR YRALMVEWKA AGRAAKDVDD ELWARFRAAQ DGFFSRRNAV NAERDAEQIA NQARKEELLV EAAALDPVDV ERSLRRYREI QERWDAIGRV PREAVGSLER QLNAIGDKLR DASDARWDRR DIAESPFLTK LRESVAKLEA KLERARAAGR AREITETENA LTTQRAWLAQ AEKGS
|
| |