Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3266 |
Symbol | |
ID | 8604612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3768208 |
End bp | 3770586 |
Gene Length | 2379 bp |
Protein Length | 792 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_003300843 |
Protein GI | 269127473 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.159842 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGATGA CCGGACGGGA CGCCGGGGAA CGGCAGGGGC ATGACCTGCG GCTGGTGATC CCCGCGCTGG GGGTCTGGGG AACGGCGTGG GCCCTGCTGG GCGCCACGGC GGCGGTCGCC TATGGGGTGG CCGTGGGGTG CGGGCTGGGA GCACTGCCGT TTCTCGCCGG ATCCTTATCC CGCTCGAGGC GGAGGAGGGA CGCTCCGGAG AGCGGGGGAG CGGCCGGCGG GGGGTTGCGG CCTGCGTTCT GCGGGCTGGC CGGGGCGGTC CTGGCGTGTG CGGCCGCCGC GGCGGCCGGG ACGGGGCTGC GGCTGTCGGC GGTCGAGTCC GGGCCGGTGC GGGAGGCGGC CCGGGCCGGG ACGCGGGCGC CGATCGAGGC GGTGGTGACC GGTGATCCGG TGGTGAAGCC GTCCGAGCGG CGGCGGATCG TGCTGTTGCG GGCCCGGGCG GAGGCGATAC GCCTCCAAGG ACGCACCGTT CGGGTGCGGG TGCCCGTCCT GCTCATCGCG ACGGAGGACG CCTGGCGTGC TCTGGTACCC GGTCAGCGAA TCCAGGCCAC CGCACGGCTG TCCCCGCCGC GGCATGCCGA GCTCTTGGCG GCCGTCGCGA TCGTCCGGGG GCCGCCGGTC GTCGTGGCGG AGCCGTCGGC CGTGCGGCGC GCGGCCGAGA AGGTGCGGGC GAGCCTGCGC GCGGCCTGCG ATGTGCTCGC GCCGGACCAG CGCGGGGTGC TGCCCGGGAT GGTCGTCGGG GACACCTCGC GGCTCGACCC CGAGCTCGCC GCGGACTTCA CGGCCGCCGG GCTCACCCAT CTGATGGTGG TCTCCGGCGC CAACCTCGCC ATCGTCGCCG GCGCGGTCCT GACGCTGTGC CGCCTGGCCG GTCTGGGACG GCGGCGGGCT CCGGCCGTGG CGGTCGCGGC GCTGCCGGCG TTCGTCGTGG TCGCCCGCCC CGAACCCAGC GTGCTGCGGG CCACGGTGAT GACCCTGATC GGCCTGCTGG CCTTCGCCAC CGGCAGCCGG CGCCAGGGGC TGCCGGCGCT GGGCGGCGCC GTGCTCGTCC TGGTCCTGAT CGATCCGGGG CTGGCCCGCT CGTACGGGTT CGCCCTGTCG GTGCTGGCCA CCGGCGGCCT GTTGCTGCTG GCCCCGCCGT GGACGGACCG GCTGTCCCGG TGGCTGCCCC GGCCGGTGGC GGAGGCGCTG GCGGTCGCGG CGGCGGCCCA GCTGGCCTGC GCCCCGGTGC TGGTGATGCT GACCGGGCAG GTCGGCCTGG TGGCGGTGGC GGCCAACCTG CTGGCCGCTC CGGCGGTCCC GGTGGCCATG CTGCTGGGGG CGCTGGCCGC GGTGATCGCC CCGCTGTGGC TCACCCTCGC CCGGCTCGTG GTGTGGCCGG CGGGACTGGC GGTCGGCTGG ATCATCGGCG TGGCCCGGAC CGCGGCCGCG GTGCCGCACG CCACCGTCCC CTGGCAGGAC GGCTTCCTGG GCGCGGTCAC CCTCACGTTC GTGCTCGTCG CCGGCTGTCT GGTGCTCCGC AAACGCCTGC TGCTCATCGC CGTGGCGGCG GCAGGCACCG GAGCGCTCCT GGCCGGCGCC GCCGCACGGG TGACGGCGCC GGCCTGGCCG CCGCCCGGCT GGGCGATGGT CGCCTGTGAC GTGGGCCAGG GCGACGCGAG CGTGCTCGCG GTGGGAAAGG GAAGCGCCGT CGTCGTGGAC GCCGGGCCGG ACCCCGGCCT CGCCGACGCC TGCCTGGACC GCCTGAAAGT GCAGACCGTG CCGCTGCTCG TCCTCACTCA TCCCCATGCC GACCACATCG GGGGAACCCC CGGGATACGA CGCGGCCGCA CCGTCGGGAC GGTGGTCATC AGCGCCCGCA GCGACGGCAG GGAGTCCCGG CACACCTTCG GGCTGCCCCT GCACGCCGCC GTCTCCGGCC GGCAGTGGCA TGTCGGGGAC TTGTCGTTGA CCGTCCTCGC CCCCTGGGAG ACGGCCCGGC CCGATGCGCG TCCCGAAGCC GACGACGAGG CCGTCAACAA CGCCAGCATC GTCCTGGTCG CCCGCAAACC CGGCTTCAGC GCCCTGCTGA CCGGCGACAT CGAGACCGAG GCCCAGCGGG CTTTGATCCC CGACGTCCCG CCCGTGCAGG TGCTCAAGGT CCCCCATCAC GGCGCCCGTG ACCAGGATCC GGCCTTCCTG GCGGCCACGC GCGCCGCGAT CTCCATCATC TCCGCCGGAG AGAACAACGA CTACGGCCAC CCGGCGCCCA GCACCCTCGC CCTGCTGCAA CGCCTTGGCA CCCGTGTCTA CCGCACCGAC CGGCACGGCG ACATCGCCAT CGTCCCCACT CCCGCCGGGC CGGCCGCCGT ACCGAGGAAA GGCGGCTGA
|
Protein sequence | MLMTGRDAGE RQGHDLRLVI PALGVWGTAW ALLGATAAVA YGVAVGCGLG ALPFLAGSLS RSRRRRDAPE SGGAAGGGLR PAFCGLAGAV LACAAAAAAG TGLRLSAVES GPVREAARAG TRAPIEAVVT GDPVVKPSER RRIVLLRARA EAIRLQGRTV RVRVPVLLIA TEDAWRALVP GQRIQATARL SPPRHAELLA AVAIVRGPPV VVAEPSAVRR AAEKVRASLR AACDVLAPDQ RGVLPGMVVG DTSRLDPELA ADFTAAGLTH LMVVSGANLA IVAGAVLTLC RLAGLGRRRA PAVAVAALPA FVVVARPEPS VLRATVMTLI GLLAFATGSR RQGLPALGGA VLVLVLIDPG LARSYGFALS VLATGGLLLL APPWTDRLSR WLPRPVAEAL AVAAAAQLAC APVLVMLTGQ VGLVAVAANL LAAPAVPVAM LLGALAAVIA PLWLTLARLV VWPAGLAVGW IIGVARTAAA VPHATVPWQD GFLGAVTLTF VLVAGCLVLR KRLLLIAVAA AGTGALLAGA AARVTAPAWP PPGWAMVACD VGQGDASVLA VGKGSAVVVD AGPDPGLADA CLDRLKVQTV PLLVLTHPHA DHIGGTPGIR RGRTVGTVVI SARSDGRESR HTFGLPLHAA VSGRQWHVGD LSLTVLAPWE TARPDARPEA DDEAVNNASI VLVARKPGFS ALLTGDIETE AQRALIPDVP PVQVLKVPHH GARDQDPAFL AATRAAISII SAGENNDYGH PAPSTLALLQ RLGTRVYRTD RHGDIAIVPT PAGPAAVPRK GG
|
| |