Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5728 |
Symbol | |
ID | 5674054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6958064 |
End bp | 6960613 |
Gene Length | 2550 bp |
Protein Length | 849 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641244581 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001509984 |
Protein GI | 158317476 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.885214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGGAGT TCGACTCGGC GAGCGGGCCG AGTCGAGGCG TTCCGATGAT TTGGGGTGGT GTACCCCAGC GGAACAAGAA CTTCACCGGG CGAGAAAGCC AGCTCGCCGA GCTCCGTCGC CGTGTCTCCT CCGAGCAGAC CGACGTCACC GCCGTCCTGC CGCACGCCCT CCAGGGCCTG GGTGGTGTCG GGAAAACTCA CCTCGCCATC GAGTACGCGT ACCGGTACCA GTCCCACTAC GACCTGATCT GGTGGATCCC CGCCGACCAG CCGGCGCTCG TCCGCTCCAA CCTGGCCGCG CTGGCGCCTC GTCTCGGCCT GTCCGAGGGC GGCCTGCTGC GGGTCGAGGA CTCGGTTGCC GCCGTTCTCG ACGCCCTGCG CCGGGGCGAG CCCTACCACC GATGGTTGTT GATCTTCGAC AACGCCGACC AGCCCGAGCT GATCCGCAGC CTGATGCCCC ACGGCCCCGG CCATGTACTC GTAACTTCGC GTAACCGGCG ATGGCAGAGC ATCGTCGACA CGATCGAGGT CGACGTCTTC GCTCGCCGCG AGAGCCTGGA GTTCCTCGAT CGTCGGGTTC CCGGCATCGC GGAGTTCGAC GCCAACCGGC TGGCCGAGGC GCTCGGGGAC CTGCCGCTGG CGCTGGAGCA GGCCGGGGCG CTGCAGTTCG AAACCGGGAT GGGCGTCGAG GAGTACCTGG ACCTGCTCGG GGAGGCGTCG AGTCGGCTGC TGGCGGAGAA CCCGCCGTCC GACTACTCGA AGCCGGTGGC GGCCGCGTGG AGCCTGTCGG TGACCCGGCT GCGCGATCAG GCGCCCTTCG CGCTCGAACT GCTGCGGCGG TGTGCCTTCT TCGGCCCGGA GCCGATCTCG CTGGAGATGC TCGACCGCGG CAAGTACACC CTGAGCTCCG ACTTCGGCCG GTCGATGCAG GACCGGTTGC TGGTGAGCCG CGCCATGCGC GAACTCGGCC GCTATGCGCT AGCGAAGATC GACACCAGTC GCAAGACGGT GCAGGTGCAC CGCCTGGTCC AGATGCTCAT CCGCGAAGAG CTCCCGGCCG AGGAGCAGGC CCGCATGCGG GACGAAGTCC ACGCACTGCT GGTGGCCGCC GACCCGGGGA ACTCCGAGAC CCAGAACCGG GTCGAGTTCG AGAACCTGCT CCCACACATC GTGCCCTCCG GTGTCTTCGA GTCCACCGAG GTCCCGGCCC GTCAGCTCAT CGAGCACATC ATCGGGTACC TGTACAACGT CGGAGACTTC ACCACGGGCC TGAACGAGGC GGACCGCGCA TTGCGGCAGT GGGAGAAGGC CTCCGGCGAG CGCGATCCGG ACGTGCTCGT GCTGAAGGGG ATCAAGGCGA ACATCCTCTG GTCGCTAGGT AGATCTCAGG ACGCATATGA CCTGCGACGG CCCACCCTGG AGGCGATCAC TGAGGTCCTC GGGCCGGACC ACGAGGAGAC TCTGATCATC CTCAACGGCC ATGGCGCCGA CCTGCGTTCC CGCGGCGAGT TCCGGGCGGC GCTCGTGCTC GACGAGGACG CCCTTCCCCG ACACGAAAGA GTTTTCGGTA CATATACTCA CGAAACCCTT CTTTGTGTCA ACAATCTCGC ATTGGACTAC AGCCTCAACA GCTCCTATGA GCTTGCGCTG CGGCGGGATC AGGAGAACCT CGCCAACAGG CGGGACCTGT CGGGCGGTGA TACGGATCCT TGGGTTGCGC TCTCGTTGGC TGCTGTCGCC CGTGACCTGC GGCAGGCGGG CCATTACCTG AAAGCCCGGG ATGCGGCGGA ACACGCCTAC CGGGTTTATG AAGATCTCGT CGACCGTGGG CGACTTACCG CCGATCATCC GTACGTTCTC GCGCAGGCGA AGGATCTCTC GGTCGCCCGG CGGAAGGCCG GGCTCTTCCC CGAGGCCCTG GTGCTCGCCG AGCAGGTCTA CGGTCGTTAC ACGGAGAGTC GGCAGTTCGA AAAAGAACAT CCCGAAGCCC TTGCCGCCGC GATCAATTTG GGGAATGCGC AGCGGGTCGC GGGTGATCCG AATGACGCGG CGGAGCGTAT CGAGAAGACC GTAAATCGGT ATCGTGAAGC TCTTGGTGCC GACCATCCGT ATACCTACGG TTGTTATCTG AACCTAGCGC TGGTCCATCG TCAGCTTGGC CGGGTCGACG AGGCCGAGCG CCTGCTCAAG GACGCGCTCG CGGGTCTGGA AGGCAGGCTC GGCCCGGACC ACCACTTCAC GTTGACCTGC CGGGCGAACC TGGCCACCGC TCGATCCGCG CAGGGTGCGG TCAGCGAGGC CCTGGAGACC GGGGAACAGA CCCTGAACTC GTTCCGTGAC CTGCTGGGTC CTGATCACCC GCACACCCTG GTCTGCGCGA CCAATGTCGC GTTGGACCTG GCCCAGCTCG GCCGGAACGA CGAGGCCAAG AACCTCTCCG CCGACACCGT CACCCGTTAC CGCCGGGTTC TGGGCCCGGA GCATCCCGAC GTGCGGGCCG GTGAGCGCGG CGAGCGGCTC GACTTCGACT TCGAGCCGCC GCCGCTGTAG
|
Protein sequence | MVEFDSASGP SRGVPMIWGG VPQRNKNFTG RESQLAELRR RVSSEQTDVT AVLPHALQGL GGVGKTHLAI EYAYRYQSHY DLIWWIPADQ PALVRSNLAA LAPRLGLSEG GLLRVEDSVA AVLDALRRGE PYHRWLLIFD NADQPELIRS LMPHGPGHVL VTSRNRRWQS IVDTIEVDVF ARRESLEFLD RRVPGIAEFD ANRLAEALGD LPLALEQAGA LQFETGMGVE EYLDLLGEAS SRLLAENPPS DYSKPVAAAW SLSVTRLRDQ APFALELLRR CAFFGPEPIS LEMLDRGKYT LSSDFGRSMQ DRLLVSRAMR ELGRYALAKI DTSRKTVQVH RLVQMLIREE LPAEEQARMR DEVHALLVAA DPGNSETQNR VEFENLLPHI VPSGVFESTE VPARQLIEHI IGYLYNVGDF TTGLNEADRA LRQWEKASGE RDPDVLVLKG IKANILWSLG RSQDAYDLRR PTLEAITEVL GPDHEETLII LNGHGADLRS RGEFRAALVL DEDALPRHER VFGTYTHETL LCVNNLALDY SLNSSYELAL RRDQENLANR RDLSGGDTDP WVALSLAAVA RDLRQAGHYL KARDAAEHAY RVYEDLVDRG RLTADHPYVL AQAKDLSVAR RKAGLFPEAL VLAEQVYGRY TESRQFEKEH PEALAAAINL GNAQRVAGDP NDAAERIEKT VNRYREALGA DHPYTYGCYL NLALVHRQLG RVDEAERLLK DALAGLEGRL GPDHHFTLTC RANLATARSA QGAVSEALET GEQTLNSFRD LLGPDHPHTL VCATNVALDL AQLGRNDEAK NLSADTVTRY RRVLGPEHPD VRAGERGERL DFDFEPPPL
|
| |