Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7293 |
Symbol | |
ID | 5675594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8909911 |
End bp | 8913198 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641246130 |
Product | hypothetical protein |
Protein accession | YP_001511518 |
Protein GI | 158319010 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3846] Type IV secretory pathway, TrbL components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.35344 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGATG TGCCACAGGC ACCGTACGGC GAGGAAGGGG ACGCGCGGCC TGTTTCGTTG ATAGGTGCGC GAATCCCCTC ATCTACGCCG TCCGCGGCCG AGCTCCCGGC GCGCGATGGC GAGAGCGACG ACCGCTCGGC GGCCAGGGCG GCCCGCCGCA GGAACCGCCG GCGCGGCGGT CCTGACGATG CCTCCGCCAC CGACGATGGT GACGGAACGT GGCAGGACCA AGCGTTCCAT CTAGAGGATG CGGTGTCCGG CGACCCGCCC CCCGCAGGCC GGTCCGGCAT GTCGGGATCA GGCCGGACGG GCATGACCCG TGGCACCACC CCGGAGCCGC GCGAGCCCGG GCAGCGCCGG GGGCGGCGCC TGCCGCGCAC GCCGCCGGCA GCCGGGCCCG TCCCGCCGAG GTCGTCTGTT CCACCCGGCA CGCCCGTTCC GCCCGGCGAC GCTGCCCGCG GCGGCCGCGC TGTGCCGCCC CGTTCGCCGC GCCCCGAATC GCCCAGGGCC GCTGACCAGG CCCGGCGGTT CGACGCGGGC GCCGCGGCGG CGGTCGACTC GCCCGACGGC TCCCTCGACG GCTCCGCCCC GCCGCGCCGG CCGATGCTGC GCCGGGCCGC CACACCCCGC GCCGTCCCGC CGCCGGCGCC GATGGGCCGC AAGGATCCGA GCTTCCCGCC CGCGACCTCG GCCGCGGCGG CAGCGACCGC CGCGGCCGCC GCCGGCACGG GCACCACGGG CACCACGGCC GCTCCGGGCA CCCCCGGCGG CGCCGGAGCC GCCGGGAGCC CGGGCTCGTC AGCCGGCTCG GGTGGGTACG CGGGCTCGAG CACGCCCGGT AGTACGGCCC GGCCTGGTTC CGGCGGGTCA GGTGCCGGCG GGCCCGGTTC CGGTGCTTCG GGTTCCGGTG CTTCGGGTTC CGGTAGCTCG AGTGCCGGTG GTTCGGGCGC GATGCGGTCC GGTGGCGCGC GGTCCGGTGG AACCGGCCGG GCCGCCGGCT CGGGTGGGTC CGCAGCCTCG CGTTCTCAGC GTCCCGAGGC GCCGGAGGCG CCGGTCCGGC GGTCCGGCGG ATCGGGCGGC ACCAGCGCGC CCCCGCCCAG CCGCCGGCCG ACGTCGGCCG GCCCGCGCCG TCCCGGGCGC CCGTCCGCTC TCGCGCCCAG GACCCCCGCC GCCAGCCCCC CGCCGGCGGG GCCGGAGACG GGATCGACCG CATGGCCCGC CCCGGCCACG GGATCGCCAC AGGCGGACCA CCCGGCGGCG GGTTCACCAC CGGCTGATCC CGCCTCCGGC GGCTCGTTCG GCGAGCCCAC GCCGGGCATG GGGCCAGGGC CGTTCGACGC GCAAGGGCCG TACGACCGGT CCGAGCCGCT CGCGCCCGGC GAGGCAGCTG GCTCGCCGGC CCCGGCGTGG CCCTCGCAGG GCTACGAGCA GGCCGACGTC CCGCTGGTGG ATCCGTCGCT CGGTGGTCTC CCCGTGGAGC TTGACGGTCT CCCCGTGGAG TTCGGTGGTC TCCCGATGGA AAGCCGCTCG CCGGGCGGCG ACCAGCCGGG CAGCGGGTCC TACTACGACC CCTGGGCCCC CCATCCGGCG CCGGGCCCGG CGCCGGATGC CGGGAACGAC GAGTCAACCG GTGCTCCCGG GTGGGGCGCC GTCGACGCCT GGCAGGAGGC CGGCGAACCG GTCAGCCTCG GGGAGCCCGC CCAGCCGGGC GAGGCGGCCC CGCCCGGCGA TCCCGCCGGC GGCAGCGGCC TGGCCGAGCG CACCCGGCGG GGCGCGGCGG CCGGCGAGAC CGAGACCGAG ACCGAGACCG ACGATTCGGC CGCGGCCGGA AGCTCGTCAA CAGGCGGAGC GCCGCCCGCC GACGAGGCCG CGTCCACGGA CGAGGCCTGG GCTGGTCACG GCGTCGCGGA ACCCGAGCGG ACGTCCGACG TGCTCCCGCG GCCCTCGACC GAGCTGGTTC CCGCCGGTCA CCTCCCGAGG CCGGGCACCG GCGCCGACCG CGCGCCGGCA CTGACCGACC AGGGCCCAGG CCGGGGCCTG TGGCCCTGGC AGGAGCCCTG GGCCGACCTC GACGACCTCG AGCCATGGGA GGAGATCCCG CTACGGATCA CTCTCTCGGA CCGGGCCGCG GTGGCCCTGC CGGGCTCGTG GCGGATCGCC GTGACCTCCC TGTTCCCCTA CTCCGGCACC ACCACGTTGG CCGGGGTCGT CGGACTTACC CTGGCCGGTG TGCGTGCCGA GCCGGTGCTC GCGATCGATC TCTACCCCGG GGCCGCCACG CCCGGTTTCG TGCCGCCCAC CTCGTCGGAG AGCGCCGAGA TCGACGAGTC GCGGGGCGAC AACCTGGTCG CGCGGGTCGG CAGCCGAGGC ACCACCACGG TTGCGGACAT CGCCCGCCAA CGCCGCTCCA CCGCCAAGGC GACGCCCGAC GAGCTGCGCT CGCTCGTCGG CGCGCGGCGG TCCGGCAGCG TCTTCGACCT TGACGTGCTG CCGGTCGACC GCCCGGTCGG CGACGAGGAG ACCACGAACG CGCTGGTGCC GGTCGACGAG CCGATCACCC CGGCCGTGCT GCGCTCCGCC CTGGGCGCGC TCGGTCACGC GTACCCGCTG ATCCTGATGG ACGCCCCCGC GACCGCACCG CTGACCCCGG AAGCGATCCG CGCCGCCGAT GTGATCCTCC TCGTGACCCT CGCGACCGCG TCCGATCTAG AAGCCACCCT CGCTGACCTG CGCGATCCCC AGGGCTCACT CGCGGAGGTC GGCGTGAGCG CGCGGGAGCG CATCCCGTCA GGCCAGGGCC CTGGCCACCC GCCGCACGAG CGGACGGGTC CCGCGGTGAT CGCGGCCGTC GTGTCGCCCC GCCGCGGCCG CCCGTCACCA CGCACACGCA CCGCCACGGC GCGGCTGGCC CGCCACGTCG ACGGCATCGT GCGGGTCCCG TACGACCCGC GGCTCGACCC GAGCAGGGGG ACCCCGGTCC GCATCCCGCG GCTGCGGTGG GCGACCAGGA GGTCCTACCT CCGGCTGGCG GCCGAGACCG TCGACGCGCT CGCCGGCATC GCCAACGCTG ATGTCGGGAC ACCGGCCGGC GGAGAAGATC CCTCCTTTCA ACCGCAGTTC GAACGACCGA TACGCACCGC TCTGGTGTTG CCACGCAACG CCGATACTGA TCACGCAGAG GTGTCAGGCG TATCATTTGG CGACCTGCGG CACGGAAGCA CACCCACGCG GGTGTCCGGG CCGGATCGTC CGGGCGGCCA ACCGCCCGCA GGAAGGGAAC CACGATGA
|
Protein sequence | MSDVPQAPYG EEGDARPVSL IGARIPSSTP SAAELPARDG ESDDRSAARA ARRRNRRRGG PDDASATDDG DGTWQDQAFH LEDAVSGDPP PAGRSGMSGS GRTGMTRGTT PEPREPGQRR GRRLPRTPPA AGPVPPRSSV PPGTPVPPGD AARGGRAVPP RSPRPESPRA ADQARRFDAG AAAAVDSPDG SLDGSAPPRR PMLRRAATPR AVPPPAPMGR KDPSFPPATS AAAAATAAAA AGTGTTGTTA APGTPGGAGA AGSPGSSAGS GGYAGSSTPG STARPGSGGS GAGGPGSGAS GSGASGSGSS SAGGSGAMRS GGARSGGTGR AAGSGGSAAS RSQRPEAPEA PVRRSGGSGG TSAPPPSRRP TSAGPRRPGR PSALAPRTPA ASPPPAGPET GSTAWPAPAT GSPQADHPAA GSPPADPASG GSFGEPTPGM GPGPFDAQGP YDRSEPLAPG EAAGSPAPAW PSQGYEQADV PLVDPSLGGL PVELDGLPVE FGGLPMESRS PGGDQPGSGS YYDPWAPHPA PGPAPDAGND ESTGAPGWGA VDAWQEAGEP VSLGEPAQPG EAAPPGDPAG GSGLAERTRR GAAAGETETE TETDDSAAAG SSSTGGAPPA DEAASTDEAW AGHGVAEPER TSDVLPRPST ELVPAGHLPR PGTGADRAPA LTDQGPGRGL WPWQEPWADL DDLEPWEEIP LRITLSDRAA VALPGSWRIA VTSLFPYSGT TTLAGVVGLT LAGVRAEPVL AIDLYPGAAT PGFVPPTSSE SAEIDESRGD NLVARVGSRG TTTVADIARQ RRSTAKATPD ELRSLVGARR SGSVFDLDVL PVDRPVGDEE TTNALVPVDE PITPAVLRSA LGALGHAYPL ILMDAPATAP LTPEAIRAAD VILLVTLATA SDLEATLADL RDPQGSLAEV GVSARERIPS GQGPGHPPHE RTGPAVIAAV VSPRRGRPSP RTRTATARLA RHVDGIVRVP YDPRLDPSRG TPVRIPRLRW ATRRSYLRLA AETVDALAGI ANADVGTPAG GEDPSFQPQF ERPIRTALVL PRNADTDHAE VSGVSFGDLR HGSTPTRVSG PDRPGGQPPA GREPR
|
| |