Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1008 |
Symbol | |
ID | 5669422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1189908 |
End bp | 1191920 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239937 |
Product | transcription termination factor Rho |
Protein accession | YP_001505370 |
Protein GI | 158312862 |
COG category | [K] Transcription |
COG ID | [COG1158] Transcription termination factor |
TIGRFAM ID | [TIGR00767] transcription termination factor Rho |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.588232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGTCGG CCTCGTCGCC GACCTCCCAG CCCGCGTCCG CGCCGGCCTC CGTGGCCCCG GCCGCGCAGG GCGCTCCGCC CGCGGCGGCT CCCGCCGCGG ACGCGGTGAG CTCCGCGCCC CGGGCGGCGG CCGGCTCGGT CGCTCCCGTC CAGGAGACGA GCGGTTCCAG CGGCGCCGAG GCGACCACGG GCGGCTCCGT TTCCGGCTCC GCGGCAGAGA CGTCGTCGGC CCCGACCCGT GTGCGCGGGC GGCGGGGCGC CTCTCGTGGC GTCACGAGCC CCGCGGGCGA GCAGCAGACA CTGCCGACCG GTCCGGGGGG TGCGGAGTCG TCCGACGACA CCGCCCGCCC GCAGGCCGCC GCGGCTGCCG CGGACGGCCC GGCGGTGACC GCGCCCGCCG CGGCGCCCGT CTCCGTCCGC GCCGGGACGA ACGGCTCCGA GAGCCCCACG CGCGGTCGTG ACGACCGCCG CGAGCGCTCC GGTGACCGGG ACCGCTCCGG CGGCGACCGG GACCGTTCGG GTGACCGTGA CCGTTCCGGC GGCGACCGGG ACCGTTCCGG CGACCGCCAG GGCCGCTCAC AGCCGGCCGG CGACCGCGAC CGGAACGACC GTGCCGACCG CAGCGACCGT GCCGACCGTG CCGACCGTGC CGACCGTGCC GACCGCAGCG ATCGTGCCGA CCGCAGCGAC CGTGCCGGTT CAGACCGGTC CAGGACGGTC GAGCGCACCC AGCCGGGCGA CCGTGCACCC CAGGGCGGCG TCCAGGACGA CGACGAGTTC GGCAGCCGGC GCCGCGGCCG GTTCCGGGAG CGCGGCCGCA ACCGCGGCCG CGGCGGGCAG GGCGGCACGA CCGAGACCGA GCCGACGGTG CGCGAGGACG ACGTCCTCGT CCCGGTGGCC GGCATCCTCG ACGTGCTGGA CAACTACGCC TTCGTCCGCA CGAGCGGCTA CCTGACCGGC CCGACGGACG TGTACGTGAG CCTCGCCCAG GTCCGTCGCA ACGGCCTGCG CCGCGGCGAC GCGATCACCG GGGTGGTGCG CGCGCCGCAG GAGGGCGAGC AGCGCCGCGA CAAGTACAAC GCGCTGGTCC GGCTGGACAC GATCAACGGG ATGGAGCCGG AGGAGGCCCG CGGCCGGCCG GAGTTCCACA AGCTCACCCC GCTCTACCCG CAGGACCGCC TGCGGCTGGA GACCGAGCCG CACATGATGA CCACGCGGGT CATCGACCTG GTGATGCCGA TCGGCAAGGG CCAGCGCGCG CTCATCGTGA GCCCGCCGAA GGCCGGCAAG ACGATGGTGC TCCAGTCGAT CGCCAACGCG ATCACCACGA ACAACCCGGA ATGCCACCTC ATGGTCGTCC TCGTCGACGA GCGGCCCGAG GAGGTCACCG ACATGCAGCG GTCGGTGAAG GGCGAGGTCG TCGCCTCGAC CTTCGACCGC CCGCCGGCCG ACCACACCAA CGTCGCCGAG CTGTCCATCG AGCGGGCCAA GCGGCTCGTC GAGCTCGGCC ACGACGTGGT CGTGCTGCTC GACTCGATCA CCCGGCTGGG TCGCGCCTAC AACCTCGCGG CGCCGGCGTC GGGGCGCATC CTGTCCGGTG GTGTCGACTC GACGGCGCTC TACCCGCCGA AGCGGTTCCT CGGCGCGGCG CGCAACATCG AGAACGGCGG CTCCCTGACG ATCATCGCGA CCGCGCTGGT CGAGACCGGT TCGACGATGG ACACGGTGAT CTTCGAGGAG TTCAAGGGCA CCGGTAACGC CGAGCTCAAG CTGGACCGGA AGATCGCCGA CAAGCGGGTC TTCCCGGCGG TGGACGTCGA CGCCTCCGGC ACCCGCAAGG AGGACATCCT GCTGGCCCCC GACGAGCTTG CGATCATGCA CAAGCTCCGC CGGGTCCTGC ACACCCGGGA GCCGCAGCAG GCGCTCGACC TCCTGCTCGA CCGGCTGAAG CAGACCAGGA CGAACTACGA GTTCCTGATG CAGATCGCGA AGACGGCACC GCCCCAGGAC TGA
|
Protein sequence | MPSASSPTSQ PASAPASVAP AAQGAPPAAA PAADAVSSAP RAAAGSVAPV QETSGSSGAE ATTGGSVSGS AAETSSAPTR VRGRRGASRG VTSPAGEQQT LPTGPGGAES SDDTARPQAA AAAADGPAVT APAAAPVSVR AGTNGSESPT RGRDDRRERS GDRDRSGGDR DRSGDRDRSG GDRDRSGDRQ GRSQPAGDRD RNDRADRSDR ADRADRADRA DRSDRADRSD RAGSDRSRTV ERTQPGDRAP QGGVQDDDEF GSRRRGRFRE RGRNRGRGGQ GGTTETEPTV REDDVLVPVA GILDVLDNYA FVRTSGYLTG PTDVYVSLAQ VRRNGLRRGD AITGVVRAPQ EGEQRRDKYN ALVRLDTING MEPEEARGRP EFHKLTPLYP QDRLRLETEP HMMTTRVIDL VMPIGKGQRA LIVSPPKAGK TMVLQSIANA ITTNNPECHL MVVLVDERPE EVTDMQRSVK GEVVASTFDR PPADHTNVAE LSIERAKRLV ELGHDVVVLL DSITRLGRAY NLAAPASGRI LSGGVDSTAL YPPKRFLGAA RNIENGGSLT IIATALVETG STMDTVIFEE FKGTGNAELK LDRKIADKRV FPAVDVDASG TRKEDILLAP DELAIMHKLR RVLHTREPQQ ALDLLLDRLK QTRTNYEFLM QIAKTAPPQD
|
| |