Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3815 |
Symbol | |
ID | 5672179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4531225 |
End bp | 4533678 |
Gene Length | 2454 bp |
Protein Length | 817 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242694 |
Product | hypothetical protein |
Protein accession | YP_001508114 |
Protein GI | 158315606 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.888105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.726246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGTCGG AGCCATCGGG TGCGAGACAA TCGCGATCCG GGGAACAACG CACCCGGGCG GCGGACCCGC TGCGCACCAA GCCGGCGCGC TCGCCTCACC CCAGCGCGGA CCGGCCCGGC AACCTGCTGG CCCTGCAGAC GCTCGCCGGC AACGCGGCGG TCACCGACCT GATCGAAGCG GCCGGACCGC CCGGTGTGGC GCGCTGGACC GGACCGATCA GCTTTCAGTC CCAGGCCTCG TTACTGGACG AGGCGCGCAA GGGCAGCTAC AACGCGGTCA TTCAGCTGGA CGAGGCGACC TTGGCCGGCG CGGACGACAA CGATCGCCTC AAGTGGATTG ATCAGGTCAA CGACAGCACT CTGGTCGTCC TGCGCGCCTC CCGGGCGCTG GAACGGATCT GGCGAAGCTT CGGCAGCCGT TTTCTCGAGG TCGCCGGGGC GAACCCGGAC AACCTGGCCC GCTGGCGGCG CAGCTGCGCG CGACACACCG GGCTGCCCGA GCAGGTGCCG CAGGCCGCGG ACCTGCAGGG CGCCTGGTTG CGGGACATTC GCACGATCGG CGGCGGCTGT CTGGACACCA ATGAGGAGTT CGCCAGGAAC AAGCTCCAGC AGTTCGGCGC GTCCGAGTCG GGCGACACCA TGGCGGCGCC AACCGACGAG CAGGCAAACG CGCTGAGCCA ACTGCAGACG GCCGCCGAGG GGCTGGCCGC GCTGCGTTGG GGACAGGAGA CCGCGCGGCA GATGTACGTC GGGTATGTCG ACTACCTGCC GCCGGCCGGT TCGATGCAGG ACGCCCACCA CTACCGCCGG GTCCGGTTCG ACCCGACCGC GGCGCCGCCG CTGACGAGGA TCGAACGTGA CCGGGAGCGG CCGGCCTACC TCTATCGCGA GCTCACCGAG ACCGAGGAGC TGGTCTCGGA CGATGCCGAC CCGAATGCCC TCGCCCCCGT CCAGTCCTAC GAGGATGCGC GCCGGAAATA CGACGAGGCG GAAGCCGCCG CGAGCGTGAC CCTGTCGATC TATCCGGAGC TGTTCGCGTT CTCCGGCAGC CAGTCCGATG CCGGGCTGGG CCAGTTCGCG GTCGCCCAAA GCAGCTCCGC GGCCCGGCAA CAGTTGGTGA CCGGGCTACG GACCATGCTG AGTCACATCC GCGCCACCAG ACAGCAGCTG GGGCCCGGCG GTGGCCTCGA CCCGCTGGAC CTCACCCCCA TCCACCGGCG ACTGCTCCGC GGCGAGATCA CGGCGTCTTC GGGCACCGAC TGGACCCGGC CCTTCGCGCG CGAGGTGGCG GGAAACCTCG TCCAGGGCCA CAACGTCGAC ATCGCCCTGC ACCGGCTGGG GCTCCAGCTG ATTGCCGAGG CCGCGTTCCT CTTCGCCCCG GCGACGGGTG GGCTGACCGC CGTCGCCGCG CTGACCCTCG CCACCGGTGC CTCCGCCGGG AACGTCGCTC TGGACGCCAG CCGGTACGCC GCCCTCGCCG ACGCGGCGGC ATCGGCGGCC CGCCCGGGTA CAGCACTGGT CGACCGCCGG ACCGTCGACG ACGCCCGGAT GGCCACCGAG TCGGAGGCGA TCGCGCTCGC CCTCGCAGCC CTCGCCCTGG GCGCCGCCGC GGCCGCCGGT GCCCTGCGCG CCTGGCGTGC CCGGCAGACG CCGCCACCAG AACAGCCGCC CCCCGCCCAG CCGCCGAGCG GGGGGCGGCC GGCCCAACAG GGCAACGCAC CCCAGCAGGG CGGACAACAA GGCGCGCCCC AGCCGGGGAC GCCACAAGCG GGCGCGACTG AGCAGGGCGC GCCGCAGCAG GGGAACGCGC CCCAGCAGCC GCCGGACCCC GCTGCGGCCG TGGTGGCCCA GGCACAGGCG CAGACACAGG CCGCCCGCGC CGCCTTCGCA GCCGAGATCG GAATCGACGC GGGGACGCTG GCCGGCTTCA CCGAGGAGGA GATCAACCGG CTGCGTCAGC TGCTTCCCAA CAGGCACCCC AGCCGAATAG CGGGGCTGCG CAACTACCTC AGCGAGCAGG TGAGCCGGGG CCGACACACC AGGAACATCC TGCGCACCCT GGAGGAGATG GAGCCACGGG AGCGGGCGCG CTACCTCGAC CGGCGAGCCG CCATCCGATG GAATCCCGAC TGGCGTGGCC GTGACCCGGC GCCCCGGCTG GAAGTCGGCA ACGCGGACGA GGGGTGGACG CACATCGATG CGCGGCATGT CACCGGCAAC GCTGCCGGCG GAGCCGGTGA CCTGTTCGCG CCGGGGACGA CACGGCAACA GATCTTCGAG GCGGCGGTCG AGGTCATCGA GCGCGGAAAC CGCGTCTCGG CCCGCGGCCA GCGGATCACG ACGTTCGAAC GGTCGTTGTA CGTCAACGGG CGGCGGGACG CGATCCGGGT GACGGTGGAC ACCTCGGACG GTCGTATTAT CACCGTCTTT CCCGTCCGCG GAGGTGGGCC GTGA
|
Protein sequence | MLSEPSGARQ SRSGEQRTRA ADPLRTKPAR SPHPSADRPG NLLALQTLAG NAAVTDLIEA AGPPGVARWT GPISFQSQAS LLDEARKGSY NAVIQLDEAT LAGADDNDRL KWIDQVNDST LVVLRASRAL ERIWRSFGSR FLEVAGANPD NLARWRRSCA RHTGLPEQVP QAADLQGAWL RDIRTIGGGC LDTNEEFARN KLQQFGASES GDTMAAPTDE QANALSQLQT AAEGLAALRW GQETARQMYV GYVDYLPPAG SMQDAHHYRR VRFDPTAAPP LTRIERDRER PAYLYRELTE TEELVSDDAD PNALAPVQSY EDARRKYDEA EAAASVTLSI YPELFAFSGS QSDAGLGQFA VAQSSSAARQ QLVTGLRTML SHIRATRQQL GPGGGLDPLD LTPIHRRLLR GEITASSGTD WTRPFAREVA GNLVQGHNVD IALHRLGLQL IAEAAFLFAP ATGGLTAVAA LTLATGASAG NVALDASRYA ALADAAASAA RPGTALVDRR TVDDARMATE SEAIALALAA LALGAAAAAG ALRAWRARQT PPPEQPPPAQ PPSGGRPAQQ GNAPQQGGQQ GAPQPGTPQA GATEQGAPQQ GNAPQQPPDP AAAVVAQAQA QTQAARAAFA AEIGIDAGTL AGFTEEEINR LRQLLPNRHP SRIAGLRNYL SEQVSRGRHT RNILRTLEEM EPRERARYLD RRAAIRWNPD WRGRDPAPRL EVGNADEGWT HIDARHVTGN AAGGAGDLFA PGTTRQQIFE AAVEVIERGN RVSARGQRIT TFERSLYVNG RRDAIRVTVD TSDGRIITVF PVRGGGP
|
| |