Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1974 |
Symbol | |
ID | 5670375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2371048 |
End bp | 2372292 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240895 |
Product | hypothetical protein |
Protein accession | YP_001506317 |
Protein GI | 158313809 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.915201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.562335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAGA ATCTGCCTGC GGCCGACCAG TGCACGCTCA GCCGGCGCGG AGTTCTCGCG GCGGCCGGCG CCGTCGGCAT CACGGCCGGC GTCGCCTCCG TGCTGCCGAT CCGCGTGGCC TTCGCCGACA CCCCCGGCCG AACCCTGGTC GTGATCTCGC TGCGCGGCGG GATGGACGGC CTGAACGCAC TGGTGCCGGG CGCGGACCCG AACTACGCGG GCCTGCGGGC CGACATCGCC ATCCCGACCG GCCAGCTCAT CCCTATGGAT CGGACGTTCG GCCTGCATCC CGCGTACGCG TCGCTGAAGC CGCTGCTGGA TGCCGGGAAG GTGGCGGCGG TTCCCGCCGC GGGGCTGCTC GACAACTCAC GCAGCCACTT TCAGGACACG TTCAACCTCG AGATCGGCGG TGCCGGCCAG AACAGCGGTT ATCTCGCCCG CCTGCTGAGC GTCCTGAGCC CGGGTTCGGC GTTCCGCGGG ATCCAGGAGG GCGGCTCGCT GCCGACCTCC TACCTCGGCT CCGCCGAGGC CCTGACTCTC AACGGCATCG ACAACTTCAA GGCGAACGGC TGGGGCCAGG GTGTCGAGGA GACCGCGACG GCCACCGCGC TCGGCGCGCT CTTCGCCGGC CCCGACGCCG GCCGCCCCTA CGCCCGCGGC GTCGCGACGA CCCTGGCCGC GCTAGCCGAG GGCAAGCAGA TCGCGGCCAA GCCCTACACC CCGGCCGACG GCGTGACCTA CCCCGACGAG GGCCTGGGCC GAGCGCTACG GGACGTCGCC CGCCTGATCA AATCCGGGAC CGGGCTCCGG GTCGCGGCGA TGGAGACCGG GGGCTACGAC ACCCATGTCG GCCAGGGCGG GGTGACCGGC AGCCTGGCGA CGCTGCAGAA GCGGCAGGGC GACTCGATCG CGGCGTTCTT CGCCGATCTC GGCCCGCAGG CCACCGACGT CACCCTGGTC ACCATCCAGG AGTTCGGCCG GCGCGCGGAC ACCAATGGAA ACGGCGGGAC CGACCACGGC GGCGGCGGCG TCATGTTCGT GATCGGCGGC GGCGCGGTCG GCGGGGTGCA CGGCAAGTGG ACGGGCCTCG CGCCCGACCA GCTGGACGAC GGCGCGGTGC CCGTCCTGAA CGACTACCGC AACGTCCTGG GCGACGTGAT CCGCTGGCTC GGGATGTCCC AGGAGCAGCT CGGGACGGTC TTCCCCGGGC TGACCCACGC CCCGGTCGGG GTCACCTCGG CCTGA
|
Protein sequence | MVKNLPAADQ CTLSRRGVLA AAGAVGITAG VASVLPIRVA FADTPGRTLV VISLRGGMDG LNALVPGADP NYAGLRADIA IPTGQLIPMD RTFGLHPAYA SLKPLLDAGK VAAVPAAGLL DNSRSHFQDT FNLEIGGAGQ NSGYLARLLS VLSPGSAFRG IQEGGSLPTS YLGSAEALTL NGIDNFKANG WGQGVEETAT ATALGALFAG PDAGRPYARG VATTLAALAE GKQIAAKPYT PADGVTYPDE GLGRALRDVA RLIKSGTGLR VAAMETGGYD THVGQGGVTG SLATLQKRQG DSIAAFFADL GPQATDVTLV TIQEFGRRAD TNGNGGTDHG GGGVMFVIGG GAVGGVHGKW TGLAPDQLDD GAVPVLNDYR NVLGDVIRWL GMSQEQLGTV FPGLTHAPVG VTSA
|
| |