Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6314 |
Symbol | |
ID | 5674633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7667317 |
End bp | 7668648 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245167 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001510562 |
Protein GI | 158318054 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGAC GCCGCCCGCT ACCCACCGCA CCGACCGGCC TGTGGGACCG CCCCGAGATG GCCCAGGCCC TCACCGCACG CGACATGAAG ACCGTGCTGG AGATCTACCG GAAGTGGACC GGCGCTTCCC AGTCGCAGAT AGCCGCCATG ACCGGCATCG CGCAGCCATC CATCAGCGCG ATTCTTCGCG AGCAACGCCA GGTCACCAAC ATCGAAAGCT TCGAGAAGTT CGCCGACGGA CTCGGCATCC CCCGCGAACG TCTCGGACTC GCCGCCCCGA AGACCACAGC TCCGGACACC GCCGACAGCG CGACGAGTCC GGATCGGCGC GCCGTGCTCG CCGCCGGAGC ACTCTTCGCG ATCGACGCCG AGTTGGACGA GGTCAGCCGC CGGATGCAGC AGGTCGCCGC ATCCAACGTC GACGATGACG CGCTGCACCA GCTCGACATC AGCATCGAAG TCGTGGGCCG CCGCTACGAG AACAGCGACG CCGCCACCGT CTACCCCGTC GCGCTGAAGC AGCGCCGGTG GGTCGCCGAC CTCATGAGCG GACACCAGCA CCCCGACCAG CGCCGGGAGC TGTACGCCAT CGGCGGGAAG CTCTCCGGCC TGCTCGGCTA TCTCGCGTTC GACCTCGGGA ACGAACTGGT CGCCCGCGCC TACTGCAACG AGGCCATGAG CCTGGCCAAG ACCGCCGGAC ACCGCGACCT CGCCGCGTGG GTCCGCGGCA CGCAGAGCTT CATCGCCTAC TACGGCGGCC GGTACCGCGA AGCCCTGGAC CTCGCCCGCG ACGGACAGCG CTACGCCCGC GGCGGCCCCG CCAGCATCCG ACTCGCGATC AGCGGCGAAG CCCGCACACT CGGGAAGCTC GGCGACATCG CCGGAGTCGA CGAGGCCGTC GGGCGCGCTC TGGCCGCCCA CGCCCGCATC GAGGACACCG ACCCCGTCGG CTACTTCCTG TCGTTCGACC CGTTCACCGC GTCCCGCATC GCCGGCAACG CCGCCTCCGC CTACCTCGCC GCCGGCGCCC CCGACCGCGC CCGCGAGTTC ACCGACCAGG CCATCCCCAT CTTCGCCGCC GCCGGGTCCA CCGCCAGCCA CGCCCTGACC CTGGTCGACG CGAGCATGAC CTACCTCACC GGCCCCGACC CGCAGCCCGA CCGCGCCGGA ACTCTCGTTG CCGAAGCACT GGACGTCGGG GCAGACCTTC GATCCGAAGT GGTCGCCCGC CGGGCCCGAG ACTTCCTGCT CACCGCCGCC CAGTGGCGCA CCGTCCCCGA GATCGCCCAG GTCAACGACG CCGTCAAAGC CTGGAGACTG CCCACCAGCT GA
|
Protein sequence | MTRRRPLPTA PTGLWDRPEM AQALTARDMK TVLEIYRKWT GASQSQIAAM TGIAQPSISA ILREQRQVTN IESFEKFADG LGIPRERLGL AAPKTTAPDT ADSATSPDRR AVLAAGALFA IDAELDEVSR RMQQVAASNV DDDALHQLDI SIEVVGRRYE NSDAATVYPV ALKQRRWVAD LMSGHQHPDQ RRELYAIGGK LSGLLGYLAF DLGNELVARA YCNEAMSLAK TAGHRDLAAW VRGTQSFIAY YGGRYREALD LARDGQRYAR GGPASIRLAI SGEARTLGKL GDIAGVDEAV GRALAAHARI EDTDPVGYFL SFDPFTASRI AGNAASAYLA AGAPDRAREF TDQAIPIFAA AGSTASHALT LVDASMTYLT GPDPQPDRAG TLVAEALDVG ADLRSEVVAR RARDFLLTAA QWRTVPEIAQ VNDAVKAWRL PTS
|
| |