Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1985 |
Symbol | |
ID | 5670386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2385741 |
End bp | 2386640 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240906 |
Product | protein of unknown function zinc metallopeptidase putative |
Protein accession | YP_001506328 |
Protein GI | 158313820 |
COG category | [R] General function prediction only |
COG ID | [COG2321] Predicted metalloprotease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.892752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.526462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTCG ACGATTCGTC GGTCGACACG TCCCAGCTCG AGGACAGACG CGGCGGCGGC CTGGGCCGCG GTGCCGTCAT CGGCGGCGGT GGTGCCGGCC TGGTCGGCCT GCTCATCTAC CTGGTGGTGG CCGTGCTCGG CGGCGGCGAC GGGACGGGCG CGAACGGAAC GGGCATCGAC GGGTCGGGGG ATGCGCAGCC GGGCACCGAA CAGGGCACGG CGGGCAGCGA CATCGCCACC CGTTGCAACA CCGCCGGCGC CCTGGACCAG TACGACGACT GCTTCGTCCT CAAGGTCTTC AACGAGGTGA ACGAGGTCTG GACGGCCCAG TTCGCCCGGG CCGGCCAGCA GTACTCCGAC CCGCGGCTGG TCTACTTCTC CGACGCCGTC TCGACCGGAT GCGGAACGGC GTCCGCGCAG GTTGGGCCCT TCTACTGCCC ACCCGACCGA CGGGTCTTCA TCGATCTCGG GTTCCTCGAT CAGCTCCAGC GGGACTTCGG GGCGCAGGGC CGGTACGCGC AGGCCTACAT CGTCGCTCAC GAGGTCGGGC ACCACATCCA GACGATCACC GGAACCGAGC AGCGGCTGCG CGAGGCGCAG AACGCCGACC CGTCCCGCCG CAACGCGCTG TCGGTGCAGC TCGAACTGCA GGCCGACTGC TACGCCGGGG TGTGGAGCAC GCTGGCGAAC TCCGCCGGCA ACGTCACGAT CAACGAAGCT GAGCTCGACG AGGCGCTCCG GGCGGCCGAG GCCGTGGGCG ACGACCGGAT CCAGGCCTCG GCCGGCGGCC GCGTCGACCC GGAGTCGTGG ACCCACGGCT CGGCGGAGCA GCGACGTGAG GCCTTCCTGA ACGGATACCG GGGCGCCTCG ATGGCGGCGT GCGGCACCGT CCCGGCCTGA
|
Protein sequence | MRFDDSSVDT SQLEDRRGGG LGRGAVIGGG GAGLVGLLIY LVVAVLGGGD GTGANGTGID GSGDAQPGTE QGTAGSDIAT RCNTAGALDQ YDDCFVLKVF NEVNEVWTAQ FARAGQQYSD PRLVYFSDAV STGCGTASAQ VGPFYCPPDR RVFIDLGFLD QLQRDFGAQG RYAQAYIVAH EVGHHIQTIT GTEQRLREAQ NADPSRRNAL SVQLELQADC YAGVWSTLAN SAGNVTINEA ELDEALRAAE AVGDDRIQAS AGGRVDPESW THGSAEQRRE AFLNGYRGAS MAACGTVPA
|
| |