Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1736 |
Symbol | |
ID | 5670138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2074983 |
End bp | 2075981 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641240654 |
Product | hypothetical protein |
Protein accession | YP_001506080 |
Protein GI | 158313572 |
COG category | [S] Function unknown |
COG ID | [COG4278] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0750549 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000433226 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGATCT CACTCGGTTA CCTGCTCACG GTGCTTGCCG CCGTCGCCGC CGTGGTGGCG CTGCGCGCGA GGGCGGTGCG GGGCCGCGCC ACCGGGGAGC CCGATCCGCG GGCGGTTCCG GCCGAGGACC TGGCGCTGGT CGCCGCCGGG CCGAAGCGGG CCGCCCTGAC CGCTCTCGTG GGCCTCTACG AGGCCGGCGC GATCTGCATC CCGCTGACCT GGCGGCTCGG CGTGTCGGGT CCGCTCCCGA CGGGGAGCAC GCCGCTCGCC GCGAGCGCGC ACCGGCTGAT CGCGGCCGCC GGCCAGCCGA GCGTCCGCGG AATTCTCCAG CCGCTCGCCA ACGGTCCCGA GCTGCGGGCC GCGCAAGCGC GCCTGGCCGC CGCCGGTTAC GTCCCGTCCG CCGGCCTGGT CGGCCTGCTG AACCTCGTGC GGCTGTTGGT GCCGCTGGCC ATCGCCGTCG CCGCTCTCGC GCTGCTCGCC GACGGCGCGG GCGCGCCGGG ACTCGTCGCG TCGGTGGCGC TCATCGGCGC GGTGGCCGTG ATCCTCCTGG CCGGGCCGGC GCCGGCGACG CGGGACGGCC GCCGGCTCGT CGAGACCGCC AGGGCCCGCT TCGCCGGATA CGCGGACGGG GGCGTACCCG GCCAGCGTGC CGTCGCGGTC GCGCTGTTCG GGCCGGCGGC CCTGTGGCGG GCCGACGCCA CCGTCGCGAA CCGGCTGGGG GTCGGACCCC GGCCCCAGGA CGTCCGCATC GGTGGCCGGG GCGACTCGGC GCGCGGGCAC ACCGGCCCGA CACACTCGGG CGACAGCGCG TTCGGTGGAA TCGCAGGCTG GTTCGCCTTC GGCGGGGACG ACTCGCACCA CGGCGGCGGG GACGGTTGGA GCAGCGGGCA CAGCTGGGGC GGTTGGGGCG GCGGGCACGG CGGTGGATGC GGAGGCGGTG GGCATGGCGG AGGTGGCGGC GGTGGAGGTT GCGGCGGTGG AGGTTGCGGC GGTGGCTGA
|
Protein sequence | MAISLGYLLT VLAAVAAVVA LRARAVRGRA TGEPDPRAVP AEDLALVAAG PKRAALTALV GLYEAGAICI PLTWRLGVSG PLPTGSTPLA ASAHRLIAAA GQPSVRGILQ PLANGPELRA AQARLAAAGY VPSAGLVGLL NLVRLLVPLA IAVAALALLA DGAGAPGLVA SVALIGAVAV ILLAGPAPAT RDGRRLVETA RARFAGYADG GVPGQRAVAV ALFGPAALWR ADATVANRLG VGPRPQDVRI GGRGDSARGH TGPTHSGDSA FGGIAGWFAF GGDDSHHGGG DGWSSGHSWG GWGGGHGGGC GGGGHGGGGG GGGCGGGGCG GG
|
| |