Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0914 |
Symbol | |
ID | 5669328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1063956 |
End bp | 1065380 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239841 |
Product | hypothetical protein |
Protein accession | YP_001505276 |
Protein GI | 158312768 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.447806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0678288 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGAGG CCGCCGGTTA CCTGCTCGTG GCCGTCCTGC TGATCGCCGG TAACGCGCTC TTCGTCGCCG CCGAATTCGC GCTCGTCGCC GTCGAGCCGC ACCAGGTCGA GGAGGCGGCG AACACGGGCG ACCGCCGCGC CGGAATCGTC CTGCGGGCGG TGCGCTCGCT GTCGTTCCAG CTCTCGGGCG CGCAGCTCGG CATCACGGTG ACCTCGCTGG TCGTGGGCTA CATCGCCGAG CCCGCCGTCG CCACGCTGCT CGAACCGCTA CTGGACGTCG CCCACATCCC GCCGTCGCCG CGCGATGTCA CCGCGATCGT GCTGGGGCTG GTAGTGGCCA CCGTGACGCA GATGGTCTTC GGTGAGCTCG TCCCGAAGAA CTGGGCGATC TCCGAGCCGG TGCGGGTCGC CCGGGCGGTC GCGCCGGCAC AGGTCATGTT CTCGCGGGTG TTCCGGCCGC TGATCACCCT CCTGAACGGT TCGGCGAACG CCCTGCTGCG GGCGATGGGC GTGGAGCCCC AGGACGAGCT GCGCAGCGGC CGGTCGTCGG ACGAGCTGAG CTCGATCGTG GCGTCCTCGG CCGAGCACGG CACCCTGCCC GTCACCACCG CGGCCCTGCT GAGCAGGTCG CTGCGCTTCG GCGACCGGCG GGCGTCCGAC GTGATGACCC CGCGGGTGCG GACGGTCTTC GCCGGGGCGG GCACCTCGCT GGCGGAACTG CTGCGCCTCG CCGAGCACAC CGGGCACTCG CGCTTCCCCG TCCTGCGCGA GGACGACGAG ATCGGCGAGG ACGGGTACGG CGTGGTCGGC GTCGTCCACG TCAAGGACGC GTTCGGAGTC CCGGCGCCGG AGCGGGCGCG GCGGACGGTG CCGGAGATCA TGGTCGAGCC GCTGCTGGTG CCCGCGTCGC TGCACTGCGA GGTCCTGCTG CGCCGGCTGC GGCGCGGCGG CCTCCAGCTC GCCGTCGTCA TCGACGAGTA CGGCGGCACG GACGGCATCG TGACGATGGA GGACCTCGTC GAGGAACTCG TCGGCGACGT CGACGACGAG CACGACCGCC CGGCACCGCC CGACGCGGTG GCCCTCGGCG CCGGCCAGTG GATGTTGTCC GGGCTGCTCC GCCTCGACGA GGTAAGCGAA GCGACCGGGG CCCGCCTGCC CGCCGGCCCC TACGAGACCA TCGGCGGGCT CGTCCTGGCC CGCCTCGGCC GGCTGGGAAG GCCGCGTGAC GTCGTCCAGG TCGAGGGCCA TGAGCTCGTT GTGGCCTCGG TCGACGGCCA CCGGATCGAC CGGGTACGCC TCAGCCCCAC GGAAGCCACG GACGGGACAC CCGCCCGGGC GGACGCACCC GGGCACGCTG GGACGTCCGG GCACGCGGGC AGGTCCGGGC ACGCGGGCAC GTCCGAGCGG GCGGGCGACC GATGA
|
Protein sequence | MLEAAGYLLV AVLLIAGNAL FVAAEFALVA VEPHQVEEAA NTGDRRAGIV LRAVRSLSFQ LSGAQLGITV TSLVVGYIAE PAVATLLEPL LDVAHIPPSP RDVTAIVLGL VVATVTQMVF GELVPKNWAI SEPVRVARAV APAQVMFSRV FRPLITLLNG SANALLRAMG VEPQDELRSG RSSDELSSIV ASSAEHGTLP VTTAALLSRS LRFGDRRASD VMTPRVRTVF AGAGTSLAEL LRLAEHTGHS RFPVLREDDE IGEDGYGVVG VVHVKDAFGV PAPERARRTV PEIMVEPLLV PASLHCEVLL RRLRRGGLQL AVVIDEYGGT DGIVTMEDLV EELVGDVDDE HDRPAPPDAV ALGAGQWMLS GLLRLDEVSE ATGARLPAGP YETIGGLVLA RLGRLGRPRD VVQVEGHELV VASVDGHRID RVRLSPTEAT DGTPARADAP GHAGTSGHAG RSGHAGTSER AGDR
|
| |