Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3331 |
Symbol | |
ID | 8604677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3848193 |
End bp | 3851171 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | translation initiation factor IF-2 |
Protein accession | YP_003300907 |
Protein GI | 269127537 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.285011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGAAGG TCCGGGTTTA CGAGCTCGCC AAGGAGTTCG GAGTAGAGAG CAAGGTCGTC ATGGCCAAGC TCCAGGAGAT GGGCGAGTTC GTCCGGTCGG CGTCCTCCAC GATCGAGGCG CCGGTCGTTC GAAGGCTTAC CGAAGCTTTC TCCAATTCTT CTCAGGGTTC TTCTCAGGGG TCCGGCAAGG GCGGCGGGGG CGGTCGCAAG CCCCAGGCCT CGCCGCGCCG GCCCGAACAG CAGGCACAGC AGGCTGCGCC CAAGCCGCAG CCGCCCGCGA GCGCCGAAGG CGGCGCTCGC CCCGCACCGC CCAAGCCGGG TCCCGTGCCC AAGCCGGGCC CGGTGCCCAA GCCCGGCCCC CGGCCCGGCC CCGCCGGCCG CGCCGCGCCG CGTCCGGGTC CGGTGCCGCA GGCCCCGGCG CAGCAGGCGC CGGGCGGCGG TGCCCGTCCC AAGCCGCAGG CGCCCGCGGC CCAGGCCGGC CCCAAGGCGC CGCCGGCCCC GGGCGGTCCG GCCGCCGGGC AGGGCGGCCC CGCGGTGCCC AAGCCGGGTC CGCGCGTGCC CAAGCCGGGG CCGCGCCCCG GCCCGCGTCC GGCCGCGCCG CGTCCGGGCA ACAACCCGTT CAGCTCCACC AACACCGGCA TGGGCACCAC GCCCAAGCCG GGTCCGCGTC CGGCTCCGCC GCCCGGTGAC CGGCAGGGCG CCGCGGGTCA GCGCGGTGCC GCCGCTCCCG GCGGTCCGCG TCCGGCCCCG CCGCGCCCGG TGCCCGGGCC GCCGCGTCCG GGTCCGCGTC CCGGCGGCGC CGCCGGCGGT CCCCGTCCGG GCGGTCCGCG GCCGAGCCCG ATGAACATGC CGTCGCGGCC CAGCGCGCTG GGACCGGGCG CCGTGCGCGG CCCCGGCCGT CCCGGTGGCG CCGGGCGTCC CGGCGTGGGC CGTCCCGCCG GCGGCGGTCG TCCCGGCGGC GGCTTCGGCG GTCCGCGTCC CGGTGGCGGC GGTCGTCCCG GCGGTGGTTT CGGCGGTGCG CGTCCCGGTG GCGGCGGCCG TGGCCGTGGC GGCGCGGCCG GTGCCTTCGG GCGTCCCGGC GGCCGTCCCA CGCGGGGCCG CAAGTCCAAG AAGCAGCGGC GCCAAGAGTT CGACAACATG CAGGCGCCGG CCATCGGCGG CGTGCAGGCT CCGCGCGGCG GCGGCAAGAC CATCCGGCTG CCGCAGGGCG CCTCGCTGAC CGACTTCGCC GAGAAGATCG GCGCCAACCC GGCGTCGCTG GTGCAGATCA TGCTGCACCT GGGCGAGATG GTCACCGCGA CCCAGTCGGT CAACGAGGAC ACCCTCAAGC TGCTCGGCGC CGAGCTGGAC TTCGACGTCC AGGTGGTCTC GCCCGAGGAC GAGGACCGCG AGCTGCTGGA GTCCTTCGAC ATCGAGTTCG GCGAGGACAT CGGTGACGAG GACCAGCTGG TGGTCCGCCC GCCGGTGGTG ACCGTGATGG GTCACGTCGA CCACGGTAAG ACCAAGCTGC TGGACGCCAT CCGCAACGCC AACGTGGCCT CCGGCGAGGC CGGCGGGATC ACCCAGCACA TCGGCGCCTA CCAGGTGCAG ACCGAGGTCG ACGGCGAAGA GCGCAAGATC ACCTTCATCG ACACCCCCGG TCACGAGGCC TTCACCGCCA TGCGTGCCCG CGGTGCCGAC ACCACCGACC TGGTGGTGCT GGTGGTCGCC GCCGACGACG GGGTCAAGCC GCAGACCACC GAGGCGATCG ACCACGCCAC CGCGGCCGGG GTGCCGATCG TGGTGGCGGT CAACAAGATC GACAAGCCGG AGGCGGACCC GCACCGGGTG CGCGCCCAGC TCACCGAGTA CGGCCTGGTC GCCGAGGAGT ACGGCGGGCA GACCCTGTTC GTGGACGTGT CGGCCAAGGA GGGCACCAAC CTCGACGAGC TGCTCGAGGC GATCATCCTG ACCGCCGACG CCGAGCTGGA CCTGAAGGCC AACCCCGACA TGCCCGCCCA GGGCGTGGCC ATCGAGGCGC ACCTGGACCG GGGCCGGGGC GCGGTGGCGA CCGTGCTGGT GCAGCGCGGC ACCCTGCGGG TCGGCGACTC GATCGTCTGC GGCGTGGCGC ACGGCCGCGT CCGGGCGATG CTGGACGAAA ACGGCAACAA CGTCGAAGAG GCGGGGCCGT CGCGTCCGGT GCAGGTGCTC GGCCTGACCG CGGTGCCCAG CGCCGGCGAC AGCTTCCTGG TGGTCGAGGA CGACCGGGTG GCCCGGCAGA TCGCCGACAA GCGCATCGCC CGCAAGCGCA ACGCCGAGCT GCTGCGCACC CGCCGTTCCC GGACGCTGGA GGAGCTGTTC TCCGACCTCA AGAAGGGCGA GCGGCAGGAA CTGCTGCTCA TCATCAAGGG CGATGTGTCC GGTTCGGTGG AGGCGCTGGA AGACTCGCTG ATGAAGATCG ACGTCGGCGA CGAGGTCGGT CTGCGGGTCA TCCGCCGCGG CGTCGGCGCC ATCACCCAAG ACGACGTCAA CCTGGCGGTG GCCTCCGACG GCGCGGTCAT CATCGGCTTC AACGTCCGGG CCGAGCGCAA CGCCCAGGAG GCGGCCGACC GCGAGGGCGT GGACATCCGC TACTACTCGG TCATCTACCA GGCGATCGAG GAAGTCGAGG CGGCCCTCAA GGGCATGCTC AAGCCCGAGT ACGAGGAGGT CCAGCTCGGC ACCGCGGAGA TCCGCGCGAT CTTCAAGGTG CCGCGGATCG GCAACGTGGC CGGGTGCATG GTGCTCACCG GCACCATCAA CCGGGGCGCC AAGGCCCGCC TGGTCCGCCA GGGGACCGTG GTCGCCGACA ACCTCTCCAT CGCCTCGCTG CGCCGCGAGA AGGACGATGT CAACGAGGTC CGCGAGGGCT TCGAGTGCGG CATCGGGCTC GGGTACCAGG ACATCAAGAT CGGTGATGTC ATCGAGTGCT TCGAGATGCG GGAGAAGCCG CGCGACTGA
|
Protein sequence | MAKVRVYELA KEFGVESKVV MAKLQEMGEF VRSASSTIEA PVVRRLTEAF SNSSQGSSQG SGKGGGGGRK PQASPRRPEQ QAQQAAPKPQ PPASAEGGAR PAPPKPGPVP KPGPVPKPGP RPGPAGRAAP RPGPVPQAPA QQAPGGGARP KPQAPAAQAG PKAPPAPGGP AAGQGGPAVP KPGPRVPKPG PRPGPRPAAP RPGNNPFSST NTGMGTTPKP GPRPAPPPGD RQGAAGQRGA AAPGGPRPAP PRPVPGPPRP GPRPGGAAGG PRPGGPRPSP MNMPSRPSAL GPGAVRGPGR PGGAGRPGVG RPAGGGRPGG GFGGPRPGGG GRPGGGFGGA RPGGGGRGRG GAAGAFGRPG GRPTRGRKSK KQRRQEFDNM QAPAIGGVQA PRGGGKTIRL PQGASLTDFA EKIGANPASL VQIMLHLGEM VTATQSVNED TLKLLGAELD FDVQVVSPED EDRELLESFD IEFGEDIGDE DQLVVRPPVV TVMGHVDHGK TKLLDAIRNA NVASGEAGGI TQHIGAYQVQ TEVDGEERKI TFIDTPGHEA FTAMRARGAD TTDLVVLVVA ADDGVKPQTT EAIDHATAAG VPIVVAVNKI DKPEADPHRV RAQLTEYGLV AEEYGGQTLF VDVSAKEGTN LDELLEAIIL TADAELDLKA NPDMPAQGVA IEAHLDRGRG AVATVLVQRG TLRVGDSIVC GVAHGRVRAM LDENGNNVEE AGPSRPVQVL GLTAVPSAGD SFLVVEDDRV ARQIADKRIA RKRNAELLRT RRSRTLEELF SDLKKGERQE LLLIIKGDVS GSVEALEDSL MKIDVGDEVG LRVIRRGVGA ITQDDVNLAV ASDGAVIIGF NVRAERNAQE AADREGVDIR YYSVIYQAIE EVEAALKGML KPEYEEVQLG TAEIRAIFKV PRIGNVAGCM VLTGTINRGA KARLVRQGTV VADNLSIASL RREKDDVNEV REGFECGIGL GYQDIKIGDV IECFEMREKP RD
|
| |