Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1582 |
Symbol | |
ID | 5669985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1889912 |
End bp | 1890811 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641240501 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001505927 |
Protein GI | 158313419 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00696223 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGGTGCTG AGCGCCGGGA CCTGCACACC CGGCCGCTGA CCACCCCGGG TACCAACGAC CCGGCGCCGG ATCAGGCTGC GCCCGTTCAC ACGGCGACCG GTCGCACGGC GCCGGGTGAC ACGGCGCCGG GCGCCGCCTC CCGTGGCGGC GCCCCCGCCC GGTTGTCCTC GCTGGGTGAC CTGCTCGCCG CGCTCCGCGG GAGACCGCGG ACCGGCGGGT ACGCCGCCGG CGCAGACCTG ACCGGCGCGG ACCTGGCCGG GGTGTGCCTC ACCGGGCGGA TCCTGCGCGG GGCGCAGCTG CACGGTGCCT ACCTCAGCGG CGCCGACCTG CGCGGGACGG ACCTCCGGGA CGCCTGCCTG CGCGGGGCCG ACCTGCGGGA CGCCGACCTC AGCCAGGCCG CGCTCGGCGG TGCGGACCTC GCCGGCGCGC TGCTCGCCGG CGCCTTCCTC ACCGGCGCCG ACCTGCACGG GACGGACCTA CACGGAGCCT TCCTCCACAA CGCGGATCTC CGGAAGGCCT TTCTCGCCCG CGCCGACCTG CGCGGAGCCG ACGCCGACGG GATCATCATG CGCGGCGCGG ACCTGCGCGC GGCCGACGCC ACCGACGCGG TCCTGCGTCA GGCGGACCTG CGTGCGGCCG ACCTGCGCGG GATCCGCCTG GCCGGGGCGA TCCTGCGCGG GGTCGACCTG CGCGGGGCGG ACCTGCGCGC CGCGGACCTG GGCACCGCCC TCCTGAGCTC CGCCCGGCTC GACCGGGTGT ACTGGTCGAC CGCGACGACC TGGCCGCCCG GTGAGTGGAC CTACCGGATG CGGTCGGCGT CCCAGCAGGT CGCCCCGGGC GTCTTCCAGG TCTCCGACGA GCCGACGCGC CCCGGCCGCC ATCGCCCGAC GCCCGACTGA
|
Protein sequence | MGAERRDLHT RPLTTPGTND PAPDQAAPVH TATGRTAPGD TAPGAASRGG APARLSSLGD LLAALRGRPR TGGYAAGADL TGADLAGVCL TGRILRGAQL HGAYLSGADL RGTDLRDACL RGADLRDADL SQAALGGADL AGALLAGAFL TGADLHGTDL HGAFLHNADL RKAFLARADL RGADADGIIM RGADLRAADA TDAVLRQADL RAADLRGIRL AGAILRGVDL RGADLRAADL GTALLSSARL DRVYWSTATT WPPGEWTYRM RSASQQVAPG VFQVSDEPTR PGRHRPTPD
|
| |