Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2455 |
Symbol | |
ID | 5670851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2919945 |
End bp | 2922434 |
Gene Length | 2490 bp |
Protein Length | 829 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241372 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001506793 |
Protein GI | 158314285 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0262289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.779517 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTGGTCGT TCTGGCAAGC TGCGGGCATG GATGTGGGCA CGGGGGCCCG GGGACCGGTC GGCGGGGAAT CAGCGTCCTC GGCGGGTTCG GGCGGTGGGG TGGACTTCTT CGTCTCCTAC ACCGGGGCGG ATGAGGCGTG GGCGACCTGG GTCGCCGAGG TGCTCGAAGC CGCGGGCCGC ACGGTGGCGG TGCAGGCGTG GGATTCCCCT GCGGGGGAGA ATTTCGTGAC CTGGATCAGC GTGCAGATGG GCGCTGCGGC CCGGACGGTT GCGGTCTGTT CGCAGGCGTA TTTCGCTTCG CATTGGTGCA CCCAGGAGTG GACCGGCGCC CTGGCCGGGC GGAAACTGAC CCCGTTGCGT GTCGCCGACT GTCCAGTCCC GCCGGTGCTG TCGACTATCT CCTACCGGGA CCTGTTCGAC GTCGACGAGC CCGTAGCGCG ACGGCGTCTG CTGGAGGCGG TCGGCCTGGT CCACCCCGTG CGGGTTTCCG GTGGCTTCCC CGGCCGGCCG GCCGCCCCGC CCTCGGTGGG GGCGGTGTTC CCGGGGCGGC TGCCCGCAGT GTGGAATGTG CCCGCGCGGA ACCTACTCTT CACCGGCCGT GACACGCTCC TGGACGGCCT GCGCACCCAG CTCGCCGCGG GTACTGGCCG TATCGCGATC GCGGCGTTGC GGGGCGCCGG TGGGGTGGGT AAGTCGCAGC TGGCGGTGGA GTTCGCCTGG CGGTATGCCG CGGACTACCA GCTGGTCTGG TGGGTGGACG CGGAAACCCC CGCGGGTCTG CTCGCCGGTC TTGCTGCTCT CGCGAACACT CTCGGGATCG GCTCGGGGGA TCTGCCGGTC CGGGCGGAGG AGGCGCTGGC GGAGCTGGGA CGTCGACAGA GCTGGCTGCT GGTCTATGAC AACGTCGGCG ACCCAACGAC GTTGGCGCGG ATGCTTCCGC CGGCCACGGG ACGGCTGGTG GTGACCTGCC GCGATCCGGG GGTGGGTCGG GTCGGGGTGG AGTTGGTCGA AGTCGGCGAG TTCACCCGCG CCGAATCGTT CGCGCTTCTG CGCCGCTATC TGCCTACCCT GTCCGACACC GCCGCCGACC AGCTCGCTGA CGCGCTCGGT GATCTACCGT TGGCTGTCGA CCAGGCCGGG GCTTTCCTGG CCACCAGTGG TATCCCCGTG CACGACTATC TGGCCCTGCT GGCCACCCGA CCCGCTTTGC TGCTCGCTGA GGAGACGCTG CATCATCCAG GGCTGGCCGC GACCGTCACC GCTGCCCGTG AACGGCTCGG CAGCGACCAT CCGGGTGCTG CTGGCCTGCT GGATCGGCTG GCGTTCCTCG CCCCCGAACC CATCCCACTC CGCCCCGCCG GCAGCGAATG GCTGGCCCTC GGGGATCCCT ACACCACGCA CACCGCGCTT GCTGCGATCA GCAGGCTCGC GCTCGCCCGC CGCACCGACA CCACCCTCCA GCTGCACCGC CTCGTCCACG CTCTCCTGCG GGCCCGCCTC ACCGTCGACC AGCGGCGCAC CGCCCTCGCT GGTGCCCTTG ACCTGCTCGC CACCGTCTAC CCCGGCGAGG CGGAGGACCC CTCGGCCTGG CCGACCTACG CCACGCTCAC TCCCCACGTC ACCGCGGTCG CAGGCCATCT CGCTGACTTT CCCGGCCTGG CCGAACCCGA CGGGTTCCGG CTGCTGCTGC ACCGGACCTG CTGGTACCTG CACTGCAGCG GCCAGAACCA CAGTGTCCGC ACCCTCAGCC ACACCACCCG CACCCGCTGG ATCACTACCC TCGGCGCTGA CCATCCCGAC AGCCAGATCA TCACCACCGC CCTCGCCACC GCGCTGGCCG ACCTCGGCGA GCACCGGGCG GCGCGGGAGC TGGCCGAAGA CACCCTGACC CGCCGCAGCC GGGTCCTCGG CGACGACCAC CCCCTCACCC TGCAATCGGC GAACAACCTC GCCAACCAGA TGGCCGCCCT CGGCGAGCAC CAAGCCGCCC GCAAGCTGGC CGAAGACACC CTGACCCGAA TGCGGCGTCT CCTCGGTCAC GACCACCCCG ACACCCTGGC CTCGGCGAAC AACCTCACCC TTCTGCTGGC CGACCTCGGC GAGCACCAGG CGGCGCGGGA GCTGGCCGAA GACACCCTGA CCCGCCGCAG CCGGGTCCTC GGCGACGACC ACCCCCACAC CCTGCGCTCG GCGAACAACC TCGCTTTCCA TCGGGCCGCG GTGGGGGAGC ACCAGGTGGC GCGGGAGCTG GCCGAGGACA CCCTGACCCG CTGCCGCCGG GTCCTCGGTG ACGACCATCC CGACACCCTG GCCTCAGCGC ACAACCTCGC CATCGATCTC AGCGCCTTGG GAGAACACCA GGCAGCCCGG GAACTGGCCG AAGAAACCCT CACCCGCCGC CGGCGCCTCC TCGGCGACGA CCACCCCCAC ACCCGCGACA GTGCCGGCCA GCTCGAGCTC CTCCTCGAAC AGCTCGGCCC GGCCCTCTGA
|
Protein sequence | MWSFWQAAGM DVGTGARGPV GGESASSAGS GGGVDFFVSY TGADEAWATW VAEVLEAAGR TVAVQAWDSP AGENFVTWIS VQMGAAARTV AVCSQAYFAS HWCTQEWTGA LAGRKLTPLR VADCPVPPVL STISYRDLFD VDEPVARRRL LEAVGLVHPV RVSGGFPGRP AAPPSVGAVF PGRLPAVWNV PARNLLFTGR DTLLDGLRTQ LAAGTGRIAI AALRGAGGVG KSQLAVEFAW RYAADYQLVW WVDAETPAGL LAGLAALANT LGIGSGDLPV RAEEALAELG RRQSWLLVYD NVGDPTTLAR MLPPATGRLV VTCRDPGVGR VGVELVEVGE FTRAESFALL RRYLPTLSDT AADQLADALG DLPLAVDQAG AFLATSGIPV HDYLALLATR PALLLAEETL HHPGLAATVT AARERLGSDH PGAAGLLDRL AFLAPEPIPL RPAGSEWLAL GDPYTTHTAL AAISRLALAR RTDTTLQLHR LVHALLRARL TVDQRRTALA GALDLLATVY PGEAEDPSAW PTYATLTPHV TAVAGHLADF PGLAEPDGFR LLLHRTCWYL HCSGQNHSVR TLSHTTRTRW ITTLGADHPD SQIITTALAT ALADLGEHRA ARELAEDTLT RRSRVLGDDH PLTLQSANNL ANQMAALGEH QAARKLAEDT LTRMRRLLGH DHPDTLASAN NLTLLLADLG EHQAARELAE DTLTRRSRVL GDDHPHTLRS ANNLAFHRAA VGEHQVAREL AEDTLTRCRR VLGDDHPDTL ASAHNLAIDL SALGEHQAAR ELAEETLTRR RRLLGDDHPH TRDSAGQLEL LLEQLGPAL
|
| |