Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0043 |
Symbol | |
ID | 5668469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 51034 |
End bp | 56460 |
Gene Length | 5427 bp |
Protein Length | 1808 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641238972 |
Product | hypothetical protein |
Protein accession | YP_001504417 |
Protein GI | 158311909 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGCG TCATTGGCAA GGTGCACGGG CCGGACGGAT CGTCGTTCGT CGGGCACAGT GTCCGCCTCC GTTACACGGT CGACCTCGGC GACCGCGATG CGACGCCGCC GACGGCACTG CCCGTGCGAC TGGAGTCGGC CGCCCCGCTC GCGGCCGACG GCGACTTCGC GCTGGCGATC GCCGACGCGC CGCTCGTCGG GACGGCGACG GTGACCATCG TCTCGCCGGC AGGCGTGACG ATCAACGAGG GGACCCTGTC CGTCGACGCG CTGGCCAACC CGGTGATCGT CTCGGCGACG CCGGCGGCGC CACTGCCCGT GCTGCCTAAC CATGAGCCGG GCACCGGGCC GCGGGTGCGT CTGACCGGGC GGGTGATCGA CGACCGGGGC CACCGCGTTT CCAGCGGCCT GCCGGTCGTG ATCTCCGGGC GCCGGGTCGA CCAGAACGGC GACCCCGAGA TCGCCGACCG CGTCCTGCTC GTCACGGACA CCCAGCCGGG CGGGACGTTC GGCGCGGACT GGCCGACCGA CGAGCTGGCC ACAGCGCACG GTGTGGTCAA GGGCCTGCCG CCGGTGCCGA TCCCATTGGT CGGCGGCCGC CTGCCGCTCA AGGTCCTCCT CGTCGTCAGC CTCCCCGACG ACGAACGCGA GGACTGCGCT TGCGACAACA CCGCACCGCG CGCCCCCGAC CCGTCCGATC TGACGAGCAA CCCGGCCGCG TTCTCGCAGG ACATGGGCGG CACGTGCGTC GACCTGACCA CGCCGAACCG GGTGGTCGAG GAGTTCGCCT ATTTCTTCGT CGTGCGCACG TCGGAGCCGG AGATCCGCGG GACCACGCTC GGCACCCGTC GGGTGCTGCC GCCGGCGGTA ACCAGGGAGC TACTCGGCGC CGTGCGCGAG GCCCTCCCCG CGGCGGAAGG AGCAAGCACT CGCGCGATCC GTGGCCTGTC GCTGGACACC GCCGCGCTGA AGTCGCTCGT CGCCACAGAC GCGATCCCCT CGGTCGACCA GCTGCAGGTC GCCGCCTACC ACTCCGAGGT GGCCGCGGTG TCCCGGCTCG TCGACGCGCT GCGCCGCCGC CAGCCCGGAC GCGTCGCCCT CGACGCCTCC TCCGCCGTCG ACTGGGACGA CACGCCGACG ATCTACCAGG CCGCCTCGGT CGCCCACGGC CACATCCTCC AGTACCGCGA GGTGTGGCGG GCCGACGGCT ACTCGCTCGG CGACCTGCTC TACTCGCTGC CGCTCGCGCC CGGCCAGAAG CGCAAGCTCG CTGTGCTCGA CTGGGAGCGC CAGACCACCT CAGCGCGCAC CGAACGACTC GACTTCGAGG AGCAGCTCGA CGCATTCGCC GAGCGCGATC GCGACATTAG CGAGATCGTC GGCTCGCACC TGAACGAGGA GTCGAGCGGC GGCTCTCGCT CGAACACGTG GGGCGTCGCC GGCGGCATCG GTGCGGGGTT CATCGGCTCC GGTTTCGGCA TCTTCGGCGG CGTCGCCGGT GGCGCCGGCG GGTCCAGCGC GCGGGCGTGG CAGGACAACG CCCGCGACCT GTCGGCCAAC AGCCTCCAGC AGCTGCGCGA CCGGATGGTG CAGCGCTCCG CTGCGGTGCG CGATGTGCGC TCGACCACCG TGCAGACCGT CGGGCAGGGC GAGACCGTGC GCGCCGAGAC GGAGACGGTG GCCAACTACA ACCACTGCCA CGCGATGACG GTGGAGTACT TCAACGTGCT GCGCCACTTC CTCGTCACCC ACGAGCTCGC CGATGTGCGC GAGTGCCTGT TCGTCCCGCT GCCGATCTCC GAGTTCGACC GGGCCAAGGC GATGCGCTGG CGCTCGGCGC TGCAGCGCTT CGTGTCCGAT CCGACACTGC GCCGCGGCTT CGACGCGATC GAGCGGATCA GCCGCAACTG GGTCGGCTAC GACTTCCCGG TCGACCGCTA CAGCGAGGAG GCCCCGGAGT CGCTCGAAGG TGAGCTGTAC GTCAGTTTCC TGCTGCCCCG CCCACGCGAT GCGGCCGACG GCACGTTCCA GGTCGACATG TGGAAGTCGT GCCAATGGCT GCTGCCGATC GACGCGCTGG AGCTGTTCAC CGCCAAGCTC GCGGCACGCG CCCAGCGAGA ACGAGACCTC GTGTTCCGCA ACGAGATCGC CCCTGGGATC GCCGAGCAGC TCGTCCAGCA CCTGCGCTTC GCTTACGTCA CCGCGTCCGG CGCGCAGCTC GGCGTCCGTC TCGACCCGAC GCTCGTGTCG CGCTACGCCG AGTCGACCCC GCTGTACGTC ACGCTGCGCC CGTCCGGCGC GGTGCCGGCG ATCCCCCGTG AGGATATCGC CCGCTTCCGC ATCTGGTACG ACGGCCCCGC GCTGCCGTCG GCGGCGCGTG TCATCGTCCA CCGCGGCAAG GTGCGCTACC GCACCGAGCA CCTCCAATAC CTCTTGTTCG ACGAGCCCCG CCTGCTCAAC GACATCTCCT CGACCGACGA CATCTACGTC GCCACGCCGC TCTCGCGCGC CGAGCTGCGC AGCCCCCGCC GTGAGGACAT CGAGGCAGCC GACCGGTTGG TCAAGCACCT CAACACCAAC ATCGAGCGCG CCCACCAAGC GATCTGGGCG TCGATGGACG AGAACCGCCG TTACCTGTTG CTCGACGGCG TGATCGCCCC GAACAGCGGT GGGCGCAGTG TCGCCAGCGT CGTCGAGAAC CGGTTGATCG GCATCGTCGG CAACTGCCTG GTGATGCCGT GCGGGCCAGG GATCCAGCTC GATCCGAGCC GAGCCGTGCC GCAGGCGCGG ACCGACGGCG AGCCCGTCGA GCCGCAGCCG CTGATCGACC TGTACGCCAC GGCGCCGCTG CCGTCGACTC GGATCAGCCT GCCGACGCGT GGCGTCTACG CCGAGGCGGT GAGCGGGGAG TGTAACGCCT GCGAGCGCAT CGACGATACC CGCTTCTGGC GCTGGGAGGA GTCTGCGATC ACGGACACCC CGCCGGACAT CGCCGCCCTG TCGACTGCCA GCCGGGCGAG CGACGAGCCA GACCTCACGC CCACACCGCT GCCGGCGGCG ATCGTCAACA TCCAGCAGCC AGCCAGCCTG CCCGACCCGC TCGGCCTGTC GGCGGCGATG AAGGTGCTGG CGCGCAACGA CCTGTTCAAG GACATCACCG GGTTGGAGGG CAGCCAGAAG AACGCGCTCG CCGGGTTCAA GGCGGCACTG GGCATGGCCC AGACACTGGG TGGCGAGGCG GCCAACCTCG CCCGACAGAA CGAGTCGGCG CGCAACGCCG ACCGGCTGAT GGACAGCATC CAGCAAGCCC AGCGCGACAA GCTGATCTCG CCGGATCAGG CCCAGCAGTT GACCCAGTCG GTGCTGCGCA GCGCGTCGGG TGAGGCATCG GCGAAGGACC GCCCGGCCGC GCCGGCGGAG GACCCGGGCG TGCAGAAGGC GATCGACTCG GCCGCCGACT CGCCGCGCGC CACAGTGTCG GTGACCACGC CCACCGAGTC GCTGGACGCG ACGTTCGACG GCGGACCGAA GCCACCGGTG ATCGGCGGCG CGCCGGCGAT CGGCGGCACC ACGCTGACGT TCGACCTCGC CATCCCGTTC ATCGACGACT GGACACAGCC GGCACCCGGC ACGCCGCCGG GACCGCTCAC CCGCCAGCGC TGGACCGAAA CGCGCAAGAT CGCCAAGCTG AGCACCGCGA GCTCGGTCAC CCAGCGGACC GGTCCGACCA GCTTCAGCCG GTTCGACCTG GCCGCCGCGG CGTTGTCCAA CGGCCTGATC AAGCGCGAGG CTCCCGGCTC CGACACGTTC GTCGTGCCGA CCAAGATGCG CCTCGCCCAC CCGGCGCCGA GCGCCGGCTC CAAGGTCGTG GCCGGTTCGG GCGCACTGCC GCTGGTCGTC TTCCTCCACG GCAACCACCA GGCCTGGGAC TTCACCTTCG GACCGCCGAC GTCGACGATG ACGGCGCTGG ACGGCGCTGG CAACCCGGTC ACCGTGGACA TTGTGGACAC GACGACCGCC TTCACGGTGA CACCGAACCA TGAGGGCTAC GCCTACCTGC AGGACTCGCT CGCCGCCTCG GGGTACGCCT CGATCTCGAT CGACACGAAC TTCGCCAACA CGTTCAACTC GGCGATCGAC ACTCGCGCCC TGACGGTGCT CGCCGCCCTC GACCGGCTGC GCACCGCGGC TGAGACGAAG GGCAACGCCT ACTTCGGCAG GTTCGACTTC CACAAGGTCG TACTGGTCGG GCACTCGCGT GGCGGCGACG CCGTGGTCCA GGTGGCCAGG CTGAACGCCA AGCGGGTGAC GAAGAAGTAC GGCGTGCTCG CGGTGTGCTC GCTCGCGCCG ACCGACTTCA GCGGCACGGC CGCGGCCGGC GACGGGCCGT TCGTCCTCAC GCCCAACGAG ACCGGGTTCT TCCTGGCGAT CCAGGCCGCG CTCGACGGCG ACGTCTCCGG CGTCGGCGGG GCCACTGCCG GGACCGGCAC GGTGTTCCGC CACTACGCCA GGGCCACGTG CCAGAAGGCG CTCGTGCAGC TCACCAAGTG CTGCCACAAT CGGTTCAACA CGGTGTGGTC GAAGGAGCCG ATCGACCGTG CCCACGACGA CTCCGCGCTC GTCGACGCTG AGTTCAACGC GCTGCACACG CATGCGACGC ACCGCCAGCT GTTCACCGAG TACCTCACGG CGCTGCTGGA GAAGCAGGTG CGCAAGAACT CGAAGGACGT CGCCCTGTTC ACCGGTCAGC GGTCCAACAG CGCCGGCGTC GACGCCGCCC ACCAGTGGGC GTTCGGGCGA ACCGTGACGT TGATGGACAG CTTCGAGGCA GCCGCCAGCG ATCTGGGCGG CGCACGCACC CTGACCGCCG CCGCGGTCGG GCCGTTCCTG GACGTCAAGG TGCCGGAGAG CACGCTCAAG CGGGGCGGCA ACGCGGGGAC GCTCGTAGCG ATCACCGAGC ACGCCACGCA CGTCACCAAG CTCGCCAACC TCGACACCGC CGTGGTCGCG TCAGGCACGG CGTTCACCAT CGACGTGCCA TCGGCCAAGC GCGACTGGAG TGGCTTCGAG GTCGTCAACC TCGACGTGGT CGGCACCTTC GACCTCACCA AGGCGTCCAG CCAACCGGTG CCGGTGCTGA CCGTGACGCT CACCGACGGG TCGGGCACCA CCAAGCAGCT CACGGCGCGC AACGTGCTCA ACCAGCCGAG CACCCACATC GTCAACTACG ACCGTGCCGT CCCCAAGGAT CCAACCAAGG AGCAGGACGC CACCCTGTTC CGGCTGGAGA CGCTGAGCTT CGAGCTGCCG CGGACCAGCA CGGTCGGCGG GATCGACTCG GGCGACATCG TCTCGCTGAC CGTCGACGTC GACCTGTCCG TCGGCAACAC CGTCCTGATC GATTCGATCA CGCTCGTCGC GCTCTGA
|
Protein sequence | MESVIGKVHG PDGSSFVGHS VRLRYTVDLG DRDATPPTAL PVRLESAAPL AADGDFALAI ADAPLVGTAT VTIVSPAGVT INEGTLSVDA LANPVIVSAT PAAPLPVLPN HEPGTGPRVR LTGRVIDDRG HRVSSGLPVV ISGRRVDQNG DPEIADRVLL VTDTQPGGTF GADWPTDELA TAHGVVKGLP PVPIPLVGGR LPLKVLLVVS LPDDEREDCA CDNTAPRAPD PSDLTSNPAA FSQDMGGTCV DLTTPNRVVE EFAYFFVVRT SEPEIRGTTL GTRRVLPPAV TRELLGAVRE ALPAAEGAST RAIRGLSLDT AALKSLVATD AIPSVDQLQV AAYHSEVAAV SRLVDALRRR QPGRVALDAS SAVDWDDTPT IYQAASVAHG HILQYREVWR ADGYSLGDLL YSLPLAPGQK RKLAVLDWER QTTSARTERL DFEEQLDAFA ERDRDISEIV GSHLNEESSG GSRSNTWGVA GGIGAGFIGS GFGIFGGVAG GAGGSSARAW QDNARDLSAN SLQQLRDRMV QRSAAVRDVR STTVQTVGQG ETVRAETETV ANYNHCHAMT VEYFNVLRHF LVTHELADVR ECLFVPLPIS EFDRAKAMRW RSALQRFVSD PTLRRGFDAI ERISRNWVGY DFPVDRYSEE APESLEGELY VSFLLPRPRD AADGTFQVDM WKSCQWLLPI DALELFTAKL AARAQRERDL VFRNEIAPGI AEQLVQHLRF AYVTASGAQL GVRLDPTLVS RYAESTPLYV TLRPSGAVPA IPREDIARFR IWYDGPALPS AARVIVHRGK VRYRTEHLQY LLFDEPRLLN DISSTDDIYV ATPLSRAELR SPRREDIEAA DRLVKHLNTN IERAHQAIWA SMDENRRYLL LDGVIAPNSG GRSVASVVEN RLIGIVGNCL VMPCGPGIQL DPSRAVPQAR TDGEPVEPQP LIDLYATAPL PSTRISLPTR GVYAEAVSGE CNACERIDDT RFWRWEESAI TDTPPDIAAL STASRASDEP DLTPTPLPAA IVNIQQPASL PDPLGLSAAM KVLARNDLFK DITGLEGSQK NALAGFKAAL GMAQTLGGEA ANLARQNESA RNADRLMDSI QQAQRDKLIS PDQAQQLTQS VLRSASGEAS AKDRPAAPAE DPGVQKAIDS AADSPRATVS VTTPTESLDA TFDGGPKPPV IGGAPAIGGT TLTFDLAIPF IDDWTQPAPG TPPGPLTRQR WTETRKIAKL STASSVTQRT GPTSFSRFDL AAAALSNGLI KREAPGSDTF VVPTKMRLAH PAPSAGSKVV AGSGALPLVV FLHGNHQAWD FTFGPPTSTM TALDGAGNPV TVDIVDTTTA FTVTPNHEGY AYLQDSLAAS GYASISIDTN FANTFNSAID TRALTVLAAL DRLRTAAETK GNAYFGRFDF HKVVLVGHSR GGDAVVQVAR LNAKRVTKKY GVLAVCSLAP TDFSGTAAAG DGPFVLTPNE TGFFLAIQAA LDGDVSGVGG ATAGTGTVFR HYARATCQKA LVQLTKCCHN RFNTVWSKEP IDRAHDDSAL VDAEFNALHT HATHRQLFTE YLTALLEKQV RKNSKDVALF TGQRSNSAGV DAAHQWAFGR TVTLMDSFEA AASDLGGART LTAAAVGPFL DVKVPESTLK RGGNAGTLVA ITEHATHVTK LANLDTAVVA SGTAFTIDVP SAKRDWSGFE VVNLDVVGTF DLTKASSQPV PVLTVTLTDG SGTTKQLTAR NVLNQPSTHI VNYDRAVPKD PTKEQDATLF RLETLSFELP RTSTVGGIDS GDIVSLTVDV DLSVGNTVLI DSITLVAL
|
| |