Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0010 |
Symbol | |
ID | 5668437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 15821 |
End bp | 17065 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641238939 |
Product | hypothetical protein |
Protein accession | YP_001504385 |
Protein GI | 158311877 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.225832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCCAG GTGAGTCGGC GACGGGTGCT TCCCTCCTCT CCCCCGAGGC GCTGCGCGCC ACCGCGGACG CGATAGCCGC GGCCCAGGAG CCCGACGGGG CGATCCCGTG GTTCGCGGGC GGCCACACCG ACCCGTGGGA CCACCTGGAA TGCGCGATGG CGCTGCTGGT CACCGGCCGG GTCGACGCAG CCGACGCCGC GTGGGACTGG CTGCACCGCC GCCAGCGCCC GGACGGCTCC TGGGCCACCA GCCACGTCGG CGGAGCAGTC AAGGAGGACT TCGCCGACAG CAACCAGTGC GCCTACGTGG CGACGGCCCT GTGGCACCGC TGGCTCGTCA CCGGCGACCG CGCTTTCGTG ACCCGGATGT GGCCGGTCGC CCGGGCTGCC CTGGACTTCG TCGTGGACAT GCAGGCCCCC GGCGGCCAGA TCTGGTGGGC CCGCACCCCG ACCGGCGAGG ACTACCCCGA GGCACTCGTC ACCGGATGCT CGTCCACCCT GCACAGCCTG CGGTGCGGCC TCGGCCTCGC CGCGCTGGTC GGCGAGGCGC GGCCGGAATG GGAGGTCGCC GCCGGCGCGC TGTGGCACGC GCTGCGCCGC CATCCCGAGT ACTTCATGCC GCGGGATCGC TGGTCGATGG ACTGGTACTA CCCGGTGATC GGCGGCGCGC TGCGCGGCGC GGAGGGGCGG GCCCGGCTGC GCTCCCGGTG GGACGAGTTC GTCGTGCCCG GTCTCGGTAT CCGCTGCGTG GACGACGAAC CATGGGTCAC CGGCGCCGAG ACCTGCGAAC TCGCCATCGC GCTGCACCTG GTGGGGGAGA CCGAGGCGGC GGCCGGCCTC GTCCGGGAGA TGCAGCACCT GCGGGCCCCC AACGGGGCCT ACTGGACCGG CTGGCAGTTC GCCGACGGAT GCCACTGGCC GGAGGAGCAG TCAACCTGGA CGGCGGCCGC CGTCGTACTG GCCGTCGACG CCCTGGCCGG CGGCCCCACC GAACGCACCT TCCGCGGGGA CGACCTGCCC GAGGGCCTGC ACGTCGTGCA CCGCGACGAC CTGCCCGAGC CACGCGACCC GTCGACGCCC CGCACCGGAC GAGGCACGGC CGGCGAACGG GCGCCGTGCG GCGAGCGGCG GGCCGGCGGC CTCGCCACCC CCGGTCGGGG CGGCGAGGCG CTCACCGGTC CGTGGCGCGC GTGCGGGTGT GACTCAGAAG AGCCGGCGGA GGACCCGCAG CGATCCCGTG CGTGA
|
Protein sequence | MRPGESATGA SLLSPEALRA TADAIAAAQE PDGAIPWFAG GHTDPWDHLE CAMALLVTGR VDAADAAWDW LHRRQRPDGS WATSHVGGAV KEDFADSNQC AYVATALWHR WLVTGDRAFV TRMWPVARAA LDFVVDMQAP GGQIWWARTP TGEDYPEALV TGCSSTLHSL RCGLGLAALV GEARPEWEVA AGALWHALRR HPEYFMPRDR WSMDWYYPVI GGALRGAEGR ARLRSRWDEF VVPGLGIRCV DDEPWVTGAE TCELAIALHL VGETEAAAGL VREMQHLRAP NGAYWTGWQF ADGCHWPEEQ STWTAAAVVL AVDALAGGPT ERTFRGDDLP EGLHVVHRDD LPEPRDPSTP RTGRGTAGER APCGERRAGG LATPGRGGEA LTGPWRACGC DSEEPAEDPQ RSRA
|
| |