Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1970 |
Symbol | |
ID | 5670371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2367046 |
End bp | 2368635 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240891 |
Product | hypothetical protein |
Protein accession | YP_001506313 |
Protein GI | 158313805 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.971424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.857591 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGCTG ACCAGAAGAC CGCGACGTCC GTCCCGGCTG ATCTGTGGTC TCGCCCGACC GGCGAGACGC GGGTCGTCGA GACGGGGAGC GCGGTCCTCT GCCTGCACGG TGATCGTGTC TACAAACGGA AGAAACCCGT TGAGCCGGGG CTCCTCGACC TGCGGAGCCG GGCGGCCCGG CTGGCGGCCT GCCGCGCCGA GGTGGAGCTG AACCGGTGGC TCGCTTCCGA CGTGTACCTG GGGGTCGCCG ACGTCCTCGG CGACGGCGGC GAGGTGTGCG ACCATGCCGT TGTCCTGCGG CGAATGCCCA CCGGCCGCCG GCTCTCCGCG CTGGTGCGCC ACGCGGACCG CGTCGACGAC CAGCTCCGCG CCGTCGCGCG GACGGTCGCC GCGTTCCACG AGCGGTGCGG GACGTCCGAG GTGATCGGCC GTTCCGGCGA CGCCGAGGCT GTCGCGGGGC AGTGGAAGGA GACGCTCGGC GGCCTCGAGC CCTTCCAGGG GAAGGTCATC GACGCCGACG TCGTCGACGA GATCGGGCGG CTGGCGCTGC GGTACCTGGC CGGCCGCGGC CCGCTGCTGG CCGAGCGCCG GCGCGCCGGG CGGATCCGCG ACGGCCACGG CGACCTGCGG GCCGAGAACA TCCACTGCCT CGACGACGGC CCGCGCATCC TGAACCGGGT CGAGTCCGAC CCGCGGCTGC GGGCCGGGGA CGTCCTCGGC GACGTCGCCG TTCTGGTGAT GGACCTGGAG CGGCTCGGCT CGCCCGAGGA CGCCGAGCGC CTGATGCGCT GGTACCGCGA CTTCTCCGCG CAGGCCCATC CGCCTTCGCT CGAGCACTTC TACATCGCCT ACCGGGCCTT CACCGAGGCC CGGGTGACCT GCCTGCGGTA CCGGCGGATC CTGGCCGAGG CGGGGGCGGA GGCCGGCCCG GGCCCGGGCG CCGAGGCCGG CGAACGGGCC CGGCGGCTCG CCGACATCGC CTACCGGCAC CTGCGCCGGG CCCGGGTACG GCTGGTCCTG GTCGGCGGCC TGCCTGGCAC CGGGAAATCG ACCCTCGCCC GGCGGCTCGC CGACGCGGAC GACGGCCGCC TGCTGCTGCG CTCCGACGCC GTGCGGGCGG AGCTCGCCGC CGACGGCCAC GCCGATCCGG ACACCCCCGG TAGCGGGCCT GCCATCCCGG ACCGGCCCGC CGTGCCCGCG GACCTCGGCG CGTCCTTCAT CTGGCCGTTG TCCTCGGAGA TCACCGCGCG GACCTACACG GTGCTGCTGT CCCGTGCCCG CCGGGCACTG GAACGCGGCG AGACGGTGAT CATCGACGCC TCCTGGTCGG ATGGCCGCCA CCGCGCGGCG GCCGCGCGGC TGGCGCGCGA GACGGCCGCG GAGTTCCTCG AGCTGCGCTG CGTGACCTCA CCGGAGGTCG CGGCCACGCG GCTGACCCGC CGGGACTCCG CCAGCGACCC AGCTGGCGCC ACGTCCGCGG TACACCGCGC GATGAGCTCG TGGGCGGAGC CGTGGCCGAC CGCCAGGGTG ATCCAGACGA CCGTGCCGGT CGCCGAGGTG TTCCACGCCG CCGAGCGCTG CATCGCCTGA
|
Protein sequence | MPADQKTATS VPADLWSRPT GETRVVETGS AVLCLHGDRV YKRKKPVEPG LLDLRSRAAR LAACRAEVEL NRWLASDVYL GVADVLGDGG EVCDHAVVLR RMPTGRRLSA LVRHADRVDD QLRAVARTVA AFHERCGTSE VIGRSGDAEA VAGQWKETLG GLEPFQGKVI DADVVDEIGR LALRYLAGRG PLLAERRRAG RIRDGHGDLR AENIHCLDDG PRILNRVESD PRLRAGDVLG DVAVLVMDLE RLGSPEDAER LMRWYRDFSA QAHPPSLEHF YIAYRAFTEA RVTCLRYRRI LAEAGAEAGP GPGAEAGERA RRLADIAYRH LRRARVRLVL VGGLPGTGKS TLARRLADAD DGRLLLRSDA VRAELAADGH ADPDTPGSGP AIPDRPAVPA DLGASFIWPL SSEITARTYT VLLSRARRAL ERGETVIIDA SWSDGRHRAA AARLARETAA EFLELRCVTS PEVAATRLTR RDSASDPAGA TSAVHRAMSS WAEPWPTARV IQTTVPVAEV FHAAERCIA
|
| |