Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5725 |
Symbol | |
ID | 5674051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6952702 |
End bp | 6954231 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244578 |
Product | hypothetical protein |
Protein accession | YP_001509981 |
Protein GI | 158317473 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.229196 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCAGGG TCGGTCCCAT ACGGAAGACG ACCTCCCGGG GTTTTCCCGG GGTACCCGAG CACGGGTACC TGCATCGGCT GACGCCGGCC GGCATGCCTG AAGCCTGGAG CGCGCGGCCG CTGTACTTCT TTTTCAGCTA TGCCAGGGCG GACAGCCGTG ACGACCCGCT GCTCGAGCGG TTCTTCCGTG ATCTCAGAGC CGAGGTGCGC CGCCGGGTGG GTCATCCGGA CCCGGACACC GTTGGCTTCC TTGACCAGCG CAGCATCCAG CCGGGGGAGG CCTGGTCGAC GGAGCTTGGC GCGGCACTCT GCCGGTGCAG AACCTTCGTG GCGATGTGCT CGCCCTCGTT CTTCGCGAGT GAGCACTGCG GCCGTGAATG GGGGATGTTC GACGAGCGGC TGCGCTCCGC CGCCGCATCG GCCCAGGCCT GTCCGGCGGC GCTGCTTCCC GTGATCTGGA CGCCGTTACG CGATCCGCCG GAGCTGCTCG CCCGGTTGCA GTATGATCAC AGCGGGTTGG GGAAGACGTA CACGCAGTTC GGGTTGCGGT ATCTCATGCA GCTCAAGCGC AATCATGACG AGTACCAGGA GTTCGTGCTG ACATTAGCGG TTCGGATCGT CCAACTCGCT GAGGACTCGC CGTTGGCCCC GAAGTCCGAG ATCCCGCCAC TCCAAGAGGT TCCCGACGCA TTCCGCGTAC TCGCCAGGCC ACCGTTGACA CGTGATCAGG AAGATGTGCC GCCTGAGATG CGGGCGACGG TCGTTCCCGA CCCGCCGCGA GCCGACCGTG CCGAGGACGA CGTCCCCGTG GTGGACAAGG GGCAGGGCAA GCCGCCGTCA CCGGATCGGC AGACGGCGCC CGCGCCTGAT CCGCCCCAGT CCTCGACTCC GCCCCGCCCG CTCCAGGAGG CGCCGTCGCC CATCGTGGGC GGGCCGGGCC GCGTGACTTT CGTGTTCGCC TCGACGTCGG CCGAGGAGAT CGCCACGTTG CGCCGTCAGC TCGACTGTTA CGGCGAGACC TTCGAGGAGT GGACGCCGTA CAAGCCTCGT GCCCACGACC GGGTCTGCGT GATCGCCCAG ATGGTTGCCG CCAGGCAGGG TCTTGTCTCG AACATCGTGC CACTGGACCA CGGCGTCTCC GATCTGCTGG AGCAGTCAAA GGTCCGCAAC GAGATCGTGA TCCTCGTTGT GGACATGTGG GCGGCCAAAC TGGAGAACGT GCGTCGGGCC CTGGCCACCT ACGACGTTCG CAACGAGCCG ACGTCGGGCG TCCTGGTGCC GGTGAACCCG CACGACCTGG AGACCCATGC CAACTCCGCG CATCTCACCG AGGTACTGGC GACCGCGCTG CGGAACAACT TCGTCCGCCG GGACAAGCTG TTCCGCATGG ATGTCCACTC CTGCGAGGAG TTCGACCTGG CCCTGGTCCA GATAATCGTC GAGTCGCAAG CGCGGATCTT CGAGTACCGG CGGGCCCTGC GGCCGACGGG GGACAACCCC CGTCGGTTTC CCCGGCTTTC CGGCCCCTAG
|
Protein sequence | MPRVGPIRKT TSRGFPGVPE HGYLHRLTPA GMPEAWSARP LYFFFSYARA DSRDDPLLER FFRDLRAEVR RRVGHPDPDT VGFLDQRSIQ PGEAWSTELG AALCRCRTFV AMCSPSFFAS EHCGREWGMF DERLRSAAAS AQACPAALLP VIWTPLRDPP ELLARLQYDH SGLGKTYTQF GLRYLMQLKR NHDEYQEFVL TLAVRIVQLA EDSPLAPKSE IPPLQEVPDA FRVLARPPLT RDQEDVPPEM RATVVPDPPR ADRAEDDVPV VDKGQGKPPS PDRQTAPAPD PPQSSTPPRP LQEAPSPIVG GPGRVTFVFA STSAEEIATL RRQLDCYGET FEEWTPYKPR AHDRVCVIAQ MVAARQGLVS NIVPLDHGVS DLLEQSKVRN EIVILVVDMW AAKLENVRRA LATYDVRNEP TSGVLVPVNP HDLETHANSA HLTEVLATAL RNNFVRRDKL FRMDVHSCEE FDLALVQIIV ESQARIFEYR RALRPTGDNP RRFPRLSGP
|
| |