Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3803 |
Symbol | |
ID | 5672167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4512279 |
End bp | 4513421 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242682 |
Product | hypothetical protein |
Protein accession | YP_001508102 |
Protein GI | 158315594 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.544146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.13322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCACGA AGTCCGGGAT CGCCGTGGCG GTCGCGACGG TCGTGCTCGT CGTGGCCGGC TCCGTGCTGG ACTATCCGGA GCTGCTCGCG CTCGGCTTGG CGGCCGGTGT CGCCCTGCTG TTCGCGGCCG GCTGGATGCT GGTCACCCCG GATGTCACCC TCTCCCGGGA GATCCACCCG CCGCGGGTCT TCGAGGGGGA CGGCGCCCGC GCCCTGATCG CGGTGACGAA TGCGGCCCGG CGGCGCAGCC CGCCCATCCT CGCCGCCGAG TCGGTGGGCG ATCGCACGGT CGCGGTGGCC CTGCCGAGCC TGGCACCGGG CAGCCGTTTC TCGGCAACCT ACCCGCTGCC GACCGACCGG CGCGGCGTCT TCGAGGTCGG GCCGCTGGTC GTGGGCCACA GCGACCCGCT GCGGCTGTTG CACGTGGGGC GGGCTTTCCC GTCCCGGTCG ATGCTGCGGG TGCACCCGCG GATCCATCCG GTGGGCCCCC TGCCGACCGG TGGTTCGCCC GATATGGACG GCCCGACCAG CGCGACCGCG CCGCAGGGCG GGGTGGCGTT CCACAGCCTG CGCGAGTACG TGCGTGGCGA CGACCTGCGG CTGATCCACT GGCGGTCGAC CGCGCGCAGC GGACGGATGA TGGTGCGCCA CAACGTGGTG CCGAACGAAC CCCGGATGAT GGTCGTGTTG GACACCAGTG AGTCGCCGTA CCAGGGCGAC TACTTCGAGG ACGCGGTCCG GGTCGCCGCA TCGCTGGCGG TGTCCGGCTG CCAGCGCGGC TTCCCGGTCG AGTTGCACAC CACCGGTGGG ACGCGGGTGG TCGCCGAGAG CGGACAGGAC ACCACCAGCG TCCTCGATGC CCTCGCCGGC GTCCGTCCCG GACCGGACGA CCCCGGGCTC ACGGCGCTGC TGCGCATGGT TCCGCGCGAG GAGGGCGCCG CGCTCGGCGT GGTGACGGGG CAGCCGCCGG GGGCGAAGAT CTCCGTCATC TCGGCGGTTC GAGCCCGGTT CGCGATGGCG AGCCTGGTCT GCGTCGGGGA GGAGCACGGT CGTCCCGGGC CTCCCGTCCG CGGGGCGCTG GTGGTGAACG TCCGCACCAG CACGGACTTC GCGTCCGTGT GGAACGCGTC GGTGCGCCGA TGA
|
Protein sequence | MITKSGIAVA VATVVLVVAG SVLDYPELLA LGLAAGVALL FAAGWMLVTP DVTLSREIHP PRVFEGDGAR ALIAVTNAAR RRSPPILAAE SVGDRTVAVA LPSLAPGSRF SATYPLPTDR RGVFEVGPLV VGHSDPLRLL HVGRAFPSRS MLRVHPRIHP VGPLPTGGSP DMDGPTSATA PQGGVAFHSL REYVRGDDLR LIHWRSTARS GRMMVRHNVV PNEPRMMVVL DTSESPYQGD YFEDAVRVAA SLAVSGCQRG FPVELHTTGG TRVVAESGQD TTSVLDALAG VRPGPDDPGL TALLRMVPRE EGAALGVVTG QPPGAKISVI SAVRARFAMA SLVCVGEEHG RPGPPVRGAL VVNVRTSTDF ASVWNASVRR
|
| |