Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1611 |
Symbol | |
ID | 5670014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1928250 |
End bp | 1929587 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240530 |
Product | hypothetical protein |
Protein accession | YP_001505956 |
Protein GI | 158313448 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00173364 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTCA GCCGCCGCCG CTTCCTCATC GCCTCCGGCG TCACCGGAGC CGGGGCCCTC GCCGCGGGCG CGGGGGTGCT CAGCTGGGAC GCCCTGCACG GCGCCGCCGC CGACCGCCCG CTGGCCGCGG GCACCGGCGT GCTGGTCGTC GTCACCCTCT ACGGCGGCAA CGACGGCCTG AACACCGTCA TCCCCTTCCA GGACCCGGCC TACCGGCAGG CACGCGGCGA CCTCGCGCCT GCCGAGCCGG ACGTCCACCC GCTGGCCGAC GGCCTCGCGC TCGCCCCGGC GCTCCCCGGG CTCGCCGGGC TGTGGAAGGA GGGCCGCCTC GCGGTCGTGC GGGGCGTCGG CTACCCGGAG CCGGATCACA GCCACTTCCG GTCGATGGCC ATCTGGCAGA CGGCGTCACC GCGGACGGCG TCGACGTCCG GCTGGATCGG GCGCTGGCTC GACGCGCTGG CTGCCAAGAA CGGCGCGACA GGCGGCGCGA CGGGCGGTCC GGCAGGCGGT TCGACGGCCA GCCCGGCGGG TGGGCAGGCC GGCGGCGCCG ATCCGCTGCG GGCGCTGGCG ATCGGCCCCA CGCTGCCGCC GCTGCTGGTC GGCGACCACA CGGTGGGCTC GGCGGTACCC ACCGGCGGCT TCGCCGCGCC CAGCGGTCGG CTGGGAGCCG ACCTGCACGC GCTCTACCGC CCCGACCCGG CCGACAGCCC GCTGGCCGCC CGGGCCGCCG CATCCGGCGC GGACCTGTTC ACCGTCGCCG GGGCGCTCGC CCCGGTGCTC GCCGACGCCC CGGAACGCGA CGCCGGCACC CAGGGCCAGC TCGACGGCGG GGATCCTGGG GCAGGCGGTG GCGATGGCGG CGAGCTCGGC GCCCAGCTCG ACGTGGTCGC CCGCGCCATA ACCGCCGGGG TGCCCACCCG GGTGTACTCG GTGAGCCTGG GCGGCTTCGA CACCCATGCC GCTGAAGACG GCACCCACAC CCGCCTGCTC GGCCAGCTGG ATGCCGCGCT GACCCGCTTC CACCGCTCCA TGGCCGCCAC GCCCCGCGGG AGCGGGGTGA CGACGATGGT CTACTCGGAG TTCGGCCGGC GGGTCGCCGC GAACGCCAGC GGCGGCACCG ACCACGGCAC CGCCGGTCCC GTCCTGCTCC TGGGGCGGCC GGTGCGCGGC GGCTTCTTCG GCGACCAGCC CCCCCTCACC GACCTCGACG ACGGCGACCT GCGGGTCACC ACGGACTTCC GCTCGGTGTA CGCGACGCTG CTGGAGCGCG TCCTGGGAAC GGAGGCGGGC ACCGTGCTCG GGTCCGACGA GAGCTTCCCC CGCCTCGCGT TTCTCTGA
|
Protein sequence | MPLSRRRFLI ASGVTGAGAL AAGAGVLSWD ALHGAAADRP LAAGTGVLVV VTLYGGNDGL NTVIPFQDPA YRQARGDLAP AEPDVHPLAD GLALAPALPG LAGLWKEGRL AVVRGVGYPE PDHSHFRSMA IWQTASPRTA STSGWIGRWL DALAAKNGAT GGATGGPAGG STASPAGGQA GGADPLRALA IGPTLPPLLV GDHTVGSAVP TGGFAAPSGR LGADLHALYR PDPADSPLAA RAAASGADLF TVAGALAPVL ADAPERDAGT QGQLDGGDPG AGGGDGGELG AQLDVVARAI TAGVPTRVYS VSLGGFDTHA AEDGTHTRLL GQLDAALTRF HRSMAATPRG SGVTTMVYSE FGRRVAANAS GGTDHGTAGP VLLLGRPVRG GFFGDQPPLT DLDDGDLRVT TDFRSVYATL LERVLGTEAG TVLGSDESFP RLAFL
|
| |