Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1182 |
Symbol | infB |
ID | 5669595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1405214 |
End bp | 1408354 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240114 |
Product | translation initiation factor IF-2 |
Protein accession | YP_001505542 |
Protein GI | 158313034 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0532] Translation initiation factor 2 (IF-2; GTPase) |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00487] translation initiation factor IF-2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.195294 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGGAA AGGCCCGCGT ACATGAGCTC GCGAAGGAGC TCGGCGTCGA CAGCAAGACC GTGCTCGCCA AGCTCAAGGA TCTCGGTGAG TTCGTGAAGT CCGCCTCGTC CACAGTCGAG GCACCCGTGG TTCGGAAACT GAAGGAGGCC TTCCCCGCGG AAGGCTCCGC TCCGTCCTCG CGTCCCGGCG GTCGTCCGGG CAACGGCGCA CGACCGATGC CGCCGCCCCG TCCGGGGCCC GCCATCGGCC GTCCCGGCCC AGGTTCCGGC ACCGGTCCCC GGCCCGGCCC TGGTGGCCGT CCGGTTCCCG GTCGCCCCGG TCCGGCCCCG CTGCCCGGGG CATCGCGCCC GTCCACCCCG ACGGCCGCGC CCCAGTCCCA GCCGGCTCAG ACCCAGCCGC CCCAGTCCCA GCCGGTGGCG CCGCAGCCGT CGCAGGCTCC GCGCCCCGCC GCGGCCGCGG CATCCGCCGC GGCGCCGGCC CCGGCACCCT CGGCTCCCGC CCCGGCACCC TCGGCTCCCG CCCCGGCTCC GATCACCTCG GCCCCGACGG CCGCCACGCC GCCGGCCGCA CCGCAGCGGC CGACTCCCGG TGGCCCGCGC CCGGGTCCCG CCGCGCCGGG TCGTCCGCGT ACCGGCGGCC CCGGTGGTCC TGGTGGCCCC GGCGGCGGCC CGCGTCCCGG GCCCCGGCCC GGCCCGCGGC CGGCCGCGCC CGGTAACAAC CCCTACACCT CCCCGGCGGC CGGCCCGCGT GCCGCGTCCG GTCAGGGTGG TTCCCCCTCG GCACCGCCGC GTCCCGGCGC GCCGCGTCCG GGCGGTCCGC GTCCGGGCGG TCCGCGTCCG GGTGGGCCCG GTGGCCAGCG TCCGTCGCCG GGGCAGATGC CCCCGCGTCC GGGCGGCTCC GGCGGTCCCC GGCCGACCCC GGGCCAGATG CCCCCGCGTC CGGGCGGCTC CGGTGGCCCG CGGCCGAACT CGAACATGTT CCAGCCCCGT CCGGCGGGTG GCGCCCCGGG CCGCCCCGGT GGCGGTGGCG GCCCCGGCCG TCCCGGTGGC GGTGGCGGTG GCCCGCGTCC CGGTGGCGGT GGGTTCGCTC CCCGTGGGGG AGCCCCCGGC CGTCCCGGTG GCGGCGGTGG CGCTCCGGGT CGTCCCGGTG GCGGCGGTCC CGGTGGCGGC GGTCGTCCGG CCGCCGGTGG CCGTGGTCGC GGCGGGACGA CGGCGGGTGC CTTCGGTCCT GGTGGCCGTG GGCGTCCCGG CCGGCAGCGC AAGTCCAAGC GCGCGAAGCG GCAGGAATGG GAGAGCGGGC TCGAGGCGCC GCGCATGGGG GCGATGGTCC CGCGCGGTAA CGGGCAGGCG ATCCGGCTGC CGCGTGGCGC GTCGCTCGCC GACTTCGCGG ACAAGATCGA CGCGAACCCC GGTGCCCTCG TCCAGGTGGT CTTCACCCAG CTCGGCGAGA TGGTCACGGC GACGCAGTCC TGCACGGACG AGACCCTGCA GCTGCTCGGT GTGACCCTGG GCTACGAGGT CCAGATCGTC AGCCCCGAGG ACGAGGACAA GGAGCTGCTG GAGAGCTTCG ACCTGTCCTT CGGCGGCGAC TACGCCGACG ACGTCGAGCT GTCGGCCCGT CCGCCGGTGG TGACCGTCAT GGGTCACGTC GACCACGGCA AGACGAAGCT GCTCGACGCG ATCCGGTCCA CGGACGTCGT CGGCGGCGAG GCCGGTGGCA TCACCCAGCA CATCGGCGCC TACCAGGTTC GTGCGGTCGT GGACGGGACC GAGCGCCCGA TCACCTTCAT CGACACCCCG GGCCACGAGA CCTTCACCGC GATGCGTGCC CGTGGTGCGC AGGTGACGGA CATCGTGGTC CTGGTGGTGG CCGCCGACGA CGGTGTGAAG CCGCAGACGA TCGAGGCGCT GAACCACGCG CAGGCGGCCA ACGTGCCGAT CGTGGTGGCA GTGAACAAGG TCGACAAGGA GGGCGCGGAC CCGGCGAAGG TCCGCGGCCA GCTCACCGAG TACGGCCTGG TCGCGGAGGA GTACGGCGGC GACACGATGT TCGTCGACGT CTCGGCCCGC AACCGGACGG GCATCGACGA GCTGACGGAG GCGGTCATCC TGACCGCCGA CGCCTCGCTC GACCTGCGCG CCCCGACCGG TACCGAGGCT CAGGGCGTCG CGATCGAAGG TCGTCTCGAC CGCGGTCGCG GCCCGGTGGC CACGGTGCTC GTCCAGCGTG GCACGCTGCG TATCGGTGAC TCGGTCGTCG CCGGCGAGGC CTTCGGCCGC GTCCGGGCGA TGCTCGACGA GAACGGCGCC CAGGTGTCCG AGGCGGGGCC GGCGCGGCCG GTGCAGGTCC TCGGTTTCAC CAGCGTCCCC GACGCCGGCG ACAACTTCCT GGTTGTGCCG GAGGACCGGG TCGCCCGCCA GATCGCCGAG CGCCGGCAGG CCCGCGAGCG CAACGCCGAG CTGGCGCTGA GCCGTGGCCG TCCGACGCTC GAGACGATCC TCGAGCGGAT GAAGGAGGGC GAGAAGACCC AGCTCAACCT CATCCTCAAG GGCGACGTCT CCGGTTCGGT GGAGGCCCTC GAGGACGCCC TGCTCAAGAT CGACGTGGGC GACGAGGTGG GTCTTCGGAT CATCGACCGC GGTGTCGGCG CGATCACCGA GACCAACGTC ATGCTGGCGT CGGCCTCCGA CGCGGTCATC ATCGGCTTCA ACGTCCGGCC GCAGGGCAAG GCGACGGAGC TGGCCGACCG CGAGGGCGTC GAGGTCCGCT ACTACTCGGT GATCTACCAG GCCATCGAGG ACATCGAGAA CGCTCTCAAG GGCATGCTCA AGCCGGTGTA CGAGGAGGCG CAGCTCGGCA CCGCCGAGGT GCGCGAGGTC TTCCGCGTAC CGCGGGTGGG CAACGTCGCC GGTTCGCTGG TCCGGTCCGG CATCATCCGC CGCAACACCA AGGCCCGCCT CATCCGGGAC GGCGTCGTGG TCGCGGACAA CCTCACCGTC GAGTCGCTCA AGCGGTTCAA GGACGACGCG ACCGAGGTCC GCGAGGGCTA CGAGTGCGGT ATCGGGCTCG GCTCGTTCAA CGACATCAAG GTCGATGACG TGATCGAGAC CTTCGAGCAG CGGGAGGTTC CCCGCACCTG A
|
Protein sequence | MAGKARVHEL AKELGVDSKT VLAKLKDLGE FVKSASSTVE APVVRKLKEA FPAEGSAPSS RPGGRPGNGA RPMPPPRPGP AIGRPGPGSG TGPRPGPGGR PVPGRPGPAP LPGASRPSTP TAAPQSQPAQ TQPPQSQPVA PQPSQAPRPA AAAASAAAPA PAPSAPAPAP SAPAPAPITS APTAATPPAA PQRPTPGGPR PGPAAPGRPR TGGPGGPGGP GGGPRPGPRP GPRPAAPGNN PYTSPAAGPR AASGQGGSPS APPRPGAPRP GGPRPGGPRP GGPGGQRPSP GQMPPRPGGS GGPRPTPGQM PPRPGGSGGP RPNSNMFQPR PAGGAPGRPG GGGGPGRPGG GGGGPRPGGG GFAPRGGAPG RPGGGGGAPG RPGGGGPGGG GRPAAGGRGR GGTTAGAFGP GGRGRPGRQR KSKRAKRQEW ESGLEAPRMG AMVPRGNGQA IRLPRGASLA DFADKIDANP GALVQVVFTQ LGEMVTATQS CTDETLQLLG VTLGYEVQIV SPEDEDKELL ESFDLSFGGD YADDVELSAR PPVVTVMGHV DHGKTKLLDA IRSTDVVGGE AGGITQHIGA YQVRAVVDGT ERPITFIDTP GHETFTAMRA RGAQVTDIVV LVVAADDGVK PQTIEALNHA QAANVPIVVA VNKVDKEGAD PAKVRGQLTE YGLVAEEYGG DTMFVDVSAR NRTGIDELTE AVILTADASL DLRAPTGTEA QGVAIEGRLD RGRGPVATVL VQRGTLRIGD SVVAGEAFGR VRAMLDENGA QVSEAGPARP VQVLGFTSVP DAGDNFLVVP EDRVARQIAE RRQARERNAE LALSRGRPTL ETILERMKEG EKTQLNLILK GDVSGSVEAL EDALLKIDVG DEVGLRIIDR GVGAITETNV MLASASDAVI IGFNVRPQGK ATELADREGV EVRYYSVIYQ AIEDIENALK GMLKPVYEEA QLGTAEVREV FRVPRVGNVA GSLVRSGIIR RNTKARLIRD GVVVADNLTV ESLKRFKDDA TEVREGYECG IGLGSFNDIK VDDVIETFEQ REVPRT
|
| |