Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2780 |
Symbol | |
ID | 5671169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3287231 |
End bp | 3290674 |
Gene Length | 3444 bp |
Protein Length | 1147 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241689 |
Product | hypothetical protein |
Protein accession | YP_001507109 |
Protein GI | 158314601 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCTTG ACGACCTGAC CTGGCGGATC CGCGACCGGA TCGAGCAGTA CGACCGGACG AGGCAGGTCG CGGAGATCCT CGACGCCGAC GCCGACCGGG ACGCCGCCGC CCTGTGGCGG CTGCTCCACG CCGTCGACCT CACCACCTCC TCCCCCGCGC TGGACGCGGC GTTCTCCATC GCCAGCGCCA CCCTGGGCCG GTTGCACCAC CGCCGTTACC AGCTGCTGCC CGTGGGCGCC GGTCTGTCCG AGCTGGCCCG GTGCCTGCTG TGCCTGGAAC CGATCTCCGA CGATCACGAG GCCGTCCCGT CCGAGCTCGT GCCCGTGGTC GGGCGGTTCA CAGATCCCGA CGTCCAGGCC GCGCTGGGCG TGCAGCTGCT CGATGCCGCG GCCGGAGGCG AGGATCCCGC CCTGCTCGAC GCGGCCATCC TGCTTCTGGC CCCTGCCGCG ACCGCGCGGC CGCGGCAGGG TCAGGGGCCG CGGCGGGGGC CCGGTCGGAG CGGCCGGCTC TCGGCGCTGT CCACCGCGTA CCGCCGACGG CACGAACGCG ACGGCACCAC CACCGACCTC GACCGGGCCC TCGACACCGG CGAACGGGCC GTGAGGCTCG CGGACCAGGA TGGCGGGGCG CCGGAGGTGT CCGTCCAGGC GTGGACCGCC CTCGCCCGGG CCTACCGCTG CCGGTACCGG CTGCACGCCG ATCCCGCGGA CCTGCAGCGC GTCATCGACC TGTCCGAGCG GGCCCTGGCG CATACCGGCC CCTCGGCGAA CCAGCTCGCG GACCTGGCCA CCGCCTATCT GCACCGGCAC GAGCACACCG ACTCACCCGC GGACCTGGAA CGGGCCGTGG ACCTGGCGGA GGACGCGGCC GCGCTGCCCG GCGGGCAGGA GGACCCCGAC GTTCTGTCCG CCCTCGGCCG CGCGCTGTTG CGCCACTATG ACCGGTCCGG GCAACGCTCC GAGCTGTGGC GGGCGGCCAC CCTCGCCGAG CAGGCCGCGG CTGCGCTGTC ACCACGCGAC CCGCGGCGCG CCACCTACCT GTGCGCCGCC GCCGCGACCC TGCTTCGGCG GCACGAACGC AGCGGGGCAC TCGGCGACCT GAACCGCGCC GTCGACCTCG GCCGGCAGGC CCTCGCGGCG ATGCCGGAGA CCGACCCCGC CCGGGCAGAC GCCCTCGGCC GGCTCGCCGC CGCGCTGCAC CGGCGTCATC GCAGCGCCGG CGCGGACACC GACCTCGACC AGGCCGAGGA CCTGGCGAGC TGGGCCCTGG CAGCGATCCC GCCCGGGCAT CCGGACCGGG CCGGTGCGGC CCTGGAACGC GCGGCCGTCC ACCTGACCCG CTACCGCCAC AGCGGCGTGA CCGCCGAGCT CGCGCGTGCG ATCGAACTCG GCGAGCAGGT CACGGCGACG GACAGCACCT CCCTGCCGGG ATGGTGGTCG CTTCTGGGTG ACGCCTACCA GCAGCGTCAT GCGATCAGCG GCGAGGCCAG CGACCTGGAC CGGGCGGTGG AGCTCGGCGA GCGGGCCCTG GCGGCCACCC GCGAGGACGA CGTGGCACGC GCCGAGCGGT ACGCCCGGCT GGCCACCGCG CACTGGCGCC GGCGCAGCCA CACGCCGGGT GGCGCCGACC TGGACCGGGC GATCGACCTG AGGGAACGGG CCGTCGCCGG CACCCCCGCC GACCACCTGG ACCTGCCGGA TCGACTGGCC GACCTCGCCG CCGCCCACCT CGACCGCTAC CGTCTCACCG GCGCGGCCGC CGACCTCGAC ACCACTGTCA CTCTGTGCGA ACGGGCTCTG GTGGCGCTCC CGGTCGACCA TCCACACCGC TCCCGGTTCA CCGCCAGCAT GTGCGTCGCC TACCTGCAAC GGATCGCCGG CGCGGGCCAG GCCCCGGACC GGTCACGACT GCGGGAGCTC GCCGACGGGA TGACCGGCGC CCAGGGCGCC GCCCCCGCCG ACCGGGTGTC TGCCCACCAT GCCGTGGGCC GGCTCGCGCA GAGCGCCGGG CAGCCGGCGC TCGCCCTCGC GATGCTGGAC GCGGCCGCCG CTCTCCTGCC GTCGGTGGCT CCCCGCGAGG CGGGCTGGGC CGACCAGCAG TACCGGCTCG GTGAACACGG CGGCCTCGTC GGAGCCGGGG TGGCCGCGCA CTGCGCGGCC GGTGACCCGG CGGGCGCCGT CGAGTTCGCC GAACTCGGCC GCGGGGTGCT CCTGGCGAGC CAGGCCAACA CCCGGGCCGA CCTCGACGAG CTCGACGACC GGGCACCACG GCTCGCCGCC CGCTTCCGCT GGGTCTGCGA GCGGCTCAAC ACCCCCGACT TCCCCGCCGA CGAACGCCGC CGATGGTGGG CCGACTACGA CCGGCTCCTC GTCGACATCC GTGCGGTCCC CGGCCTCATG CACTTCGTCG CGGCGCCGCA ACTGGCGGAG CTGGCCCCCG CCGCCGCGGG TGGGTGCGTG ATCCTCGTCA ACGCCGATAC GCACCGAAGC GACGCCATCC TCGTGCGGGC CGACACCGAC CCCGTGTCCG TCGCGCTGCC CGACCTGCGG CCGTCCGACG TCAACAAGCA GGTCACCGCC TTGCTCGCCG CCCTCAACAG CGGCTCCACC CTGGCCGGGG CACTGCGTCG GCGTCTGGTG GTGACCGCGG TGCTGGGCTG GTTGTGGGAC GTCGTCGTGG CCCCGGTCGC CGCCGCCCTG CCTCCCGGCG ACACTGCCCA GCGGGTGTGG TGGCTGCCAA CCGGGCTCCT CGGACTGCTG CCGTTGCACG CCGCCGGCCA CCCCGGCCAG GACGGCGCTC TCGACACCAT GATCTCCTCC TACATCCCCT CGCTGCGGGC ACTGCGGGCC GCCCGCAGCC GCCCGCCGGC CCGACGGCGC CAGAACCTGT CCGTCGTCAT GAGCGCCACC CCGGACATGC CGGAGCTACC CGGCGCCGAA AAGGAGGCGG CCGTGGTGGA CGGCCCGTCC CTGCTCAACG CGGACGCGAC CGCAGATCAA GTCCTGACCG CGCTACGGCA GACAACCTGG GCGCATTTCG CCTGCCACGG CGTGATCAAC GCGACCTCGC AGGTCGACAG CGGTCTGCGG GTGCACGACC GCATCCTGAC ACTGCCCGAG ATCGGCGGTC TGCGGCTGAC CGACGCCGAA CTCGCCTACC TGTCCGCCTG CTCCACCGCC AACCACGGCA CCCGCTACGC CGACGAGGTG CTGCACCCGG CCGCCGCCTT CCAGCTCGCC GGTTTCCGGC ATGTGGTGGC CAGCCTGTGG CCACTCGCCG ACGGTGACGC CGTGGACGCC GCCCGCGCGT TCTACCAGCA TTTCGCCGAC ACTCCGGTCG CCGACCAGGC AGCCCCCGTG CTGCATACCG TCACCCTGCG TCTACGGGAC CAGTATCCAG AACGCCCCGA CCTGTGGGCA CCACTCGTCC ACAGCGGCCC CTGA
|
Protein sequence | MPLDDLTWRI RDRIEQYDRT RQVAEILDAD ADRDAAALWR LLHAVDLTTS SPALDAAFSI ASATLGRLHH RRYQLLPVGA GLSELARCLL CLEPISDDHE AVPSELVPVV GRFTDPDVQA ALGVQLLDAA AGGEDPALLD AAILLLAPAA TARPRQGQGP RRGPGRSGRL SALSTAYRRR HERDGTTTDL DRALDTGERA VRLADQDGGA PEVSVQAWTA LARAYRCRYR LHADPADLQR VIDLSERALA HTGPSANQLA DLATAYLHRH EHTDSPADLE RAVDLAEDAA ALPGGQEDPD VLSALGRALL RHYDRSGQRS ELWRAATLAE QAAAALSPRD PRRATYLCAA AATLLRRHER SGALGDLNRA VDLGRQALAA MPETDPARAD ALGRLAAALH RRHRSAGADT DLDQAEDLAS WALAAIPPGH PDRAGAALER AAVHLTRYRH SGVTAELARA IELGEQVTAT DSTSLPGWWS LLGDAYQQRH AISGEASDLD RAVELGERAL AATREDDVAR AERYARLATA HWRRRSHTPG GADLDRAIDL RERAVAGTPA DHLDLPDRLA DLAAAHLDRY RLTGAAADLD TTVTLCERAL VALPVDHPHR SRFTASMCVA YLQRIAGAGQ APDRSRLREL ADGMTGAQGA APADRVSAHH AVGRLAQSAG QPALALAMLD AAAALLPSVA PREAGWADQQ YRLGEHGGLV GAGVAAHCAA GDPAGAVEFA ELGRGVLLAS QANTRADLDE LDDRAPRLAA RFRWVCERLN TPDFPADERR RWWADYDRLL VDIRAVPGLM HFVAAPQLAE LAPAAAGGCV ILVNADTHRS DAILVRADTD PVSVALPDLR PSDVNKQVTA LLAALNSGST LAGALRRRLV VTAVLGWLWD VVVAPVAAAL PPGDTAQRVW WLPTGLLGLL PLHAAGHPGQ DGALDTMISS YIPSLRALRA ARSRPPARRR QNLSVVMSAT PDMPELPGAE KEAAVVDGPS LLNADATADQ VLTALRQTTW AHFACHGVIN ATSQVDSGLR VHDRILTLPE IGGLRLTDAE LAYLSACSTA NHGTRYADEV LHPAAAFQLA GFRHVVASLW PLADGDAVDA ARAFYQHFAD TPVADQAAPV LHTVTLRLRD QYPERPDLWA PLVHSGP
|
| |