Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2478 |
Symbol | |
ID | 5670874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2954378 |
End bp | 2956135 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241395 |
Product | hypothetical protein |
Protein accession | YP_001506816 |
Protein GI | 158314308 |
COG category | [R] General function prediction only |
COG ID | [COG1568] Predicted methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.627901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTCA ACATGACCGG CGAAAAATCC TCGACGGACG AGTCCTTGGC CGATAAGGCC CCGATGGACA ACGCCGCCGC CTATGTCGCC GGCTACGGTG TGCGATCCCG CCCACTGCGC GAAATTCTCT CGCTGTTGAC CGGCGGCAGC CAGCCGATCG ACGTTCTGAT AACGCGGACC GCGACACCGC GGCGGGCGGT CGAGGAGCTT CTGCGTAGCC TTGGCCCGGA TCTCACCGAA AACAAGCACG GTTACCGCCT ACGGGCGGAA GTCATCGCCG AATATCGCGC CCGATTCGCC CTGGACGGCC TGGAGCCGGC CGGCGCCGCG GGCGATGACC GCGACGGCGA CGGGCTCGGC GCCCTTGCCC TGCTGATCAA GAACGCCCCC GCACCCCGCC GGGACCTCGA CCATGTGGCC GCGACAGCGG GAACGCTCGC CCGCCGCGCG ACCTGGCTCG ACACCACCTA CGACCTGGCC GGCCGGCGGG TGCTGTTCGT CGGCGACCAC GACCTGACCT CGGTGGCGCT GGCCCGGCGG CAGCCCGGCG CCGAGATCAC CGTGGTCGAC CTGGATGAGC GGACCCTCGC CTACATCGAC GCCACGGCAC GCTCCGAGGG ACTGTCGATC CGCACGCTGT TCGGCGACCT GCGATTCACC CTGCCGCCCG CCGCGCGGGA GTGGGCGGAT CTCGTCCTCA CCGATCCGCC GTACACCCCC GAAGGGGTCG GTCTTTTCCT CGGGAGAGCG CTCGCCGGCC TGGGTGACCG GAAAAACGGA GTCGTCGTCG TCGCCTACGG CCACAGCCGG CTCCATCCGA TGCTGGGTTT TCAGGTACAG CAGTCCATGC AGCAGTTCGG TGTCGTCTTC GAAGCCATAC TGCCGGCATT CAACCGTTAT GACGGCGCGC AGGCCGTGGG AAGTGCCAGC GATCTGTACG TGTGCGCGCC GACCAGCCGC ACGTGGAAAG TCCTCGAGCG GGCGGTGGAG AGCTTCGGAA CGCGCATCTA CACCCACGGG ACGCAGTCCG TGGAGAGCAC CGCGGCGGTC GAGCTCGGGC CGGCCGCGAC GGTGATCGGC GACGCCACCG CCGCCGGATC GCCGGGCGCC CGCCGCCTCG GCCTGCGCTC GCTGTTCACC GGCTCGGACT ACCTTCGCGA CCTCGGCGAC AACGCGGACG TCGCCGTGGA CCTCACCGCC GACCCCGGTC CGCTGCTGCT GCGAGCCCTG CTCGCCGTGA CCGCCCGGAA GGTCCGGTTC GTCGTGCCGG TCGACCATCC CGACGTGTCG ACCCCCGGCG CCCGGGCGGC TCTCGCGAGC CTGGTCGCCC CGAAGTACCG CCTGACGTTC CCCCCACCGG CCTCAGCGGG ACGTCCCCGA GGCGAGGCCG GGCACGAGGG CGGCGGATAC GGGGTCGTCC ACGCGGACCT GGTCGACGCG GAGGCGGACA CCGGCGGCGA GGCGGACGTC GACGTGGGCG CCGGCGGTCC GACCCGGCCG CCCGCCCCGG ACGTAACCGC GCGTTGGCTG CTCGAGCGGG CCCACGGCCG GATCGGGAAC GTGCTGCGCG AGGGCGTCAT CCGTGCGGCC GCCCGGGACG GCCGGGCGAT CTCCAAGAAC GACGCCCGCG CGCTCGTCCG AGCTCAGGTG GGCACAGCCG ACCTCGACAC GCTGGACCTG ACGGCGATCG AGACCCCCCG CGCACGCCTG GAGCGGGTGC TCCAGGCCGT CCGCGCGCCG GGGCCGGCTC GATCTTGA
|
Protein sequence | MNVNMTGEKS STDESLADKA PMDNAAAYVA GYGVRSRPLR EILSLLTGGS QPIDVLITRT ATPRRAVEEL LRSLGPDLTE NKHGYRLRAE VIAEYRARFA LDGLEPAGAA GDDRDGDGLG ALALLIKNAP APRRDLDHVA ATAGTLARRA TWLDTTYDLA GRRVLFVGDH DLTSVALARR QPGAEITVVD LDERTLAYID ATARSEGLSI RTLFGDLRFT LPPAAREWAD LVLTDPPYTP EGVGLFLGRA LAGLGDRKNG VVVVAYGHSR LHPMLGFQVQ QSMQQFGVVF EAILPAFNRY DGAQAVGSAS DLYVCAPTSR TWKVLERAVE SFGTRIYTHG TQSVESTAAV ELGPAATVIG DATAAGSPGA RRLGLRSLFT GSDYLRDLGD NADVAVDLTA DPGPLLLRAL LAVTARKVRF VVPVDHPDVS TPGARAALAS LVAPKYRLTF PPPASAGRPR GEAGHEGGGY GVVHADLVDA EADTGGEADV DVGAGGPTRP PAPDVTARWL LERAHGRIGN VLREGVIRAA ARDGRAISKN DARALVRAQV GTADLDTLDL TAIETPRARL ERVLQAVRAP GPARS
|
| |