Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4114 |
Symbol | |
ID | 8605470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 4695564 |
End bp | 4697213 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_003301680 |
Protein GI | 269128310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGACG TAGGCGACTC CGATCCGATC GAGCGCTACT TCCGCCCCCG TCCGGAAAGC CCGACCGGCC CGGAGGAGGG CACCGTCCCC GAACGGCCCG GCGAGGGGGT CACGGTGGAC GGGGTGCCGG CGCCCAGGGT GACCGTGGAG GCCGCCGGCG GCGGCCCGCG CCGGCCCCGC CGCTCGATGA GCCACCGTTC GGCCGCCGGC GCCCGGCGGC AGCGCCGGTT CCTGATGACC GTGGGCGCGA TGTCGGCGCT CGTGCTGGTG ACCTCCGGGG GGGCCTGGGG GTTCCAGAAC TATGTGACCG GCGCCCTGGA CAAGGTGCGG GTCGGCGGGC TCGGCGACGG CGGCGGCCCC AAGGGGGCGA TGACGATCCT GGTGGCGGGC GTGGACCGCC GGGACGGGCT GACGCCCGAG CAGCAGAAGG CGGCCCGGCT GGGCCGGGCC GAGGGCGAGC GCTCCGACAC GATGATGCTG GTGCACATCT CGCGCGACCA CGACCGGATC TCGGTGGTGA GCCTGCCGCG CGACTCGCTG GTGATCATCC CGGCGCACCG CTCCAACGGC TCCGAGGGCC CCAAGGGGAC GCCGATCCCC GCCCGCTACG GCAAGCTGAC CTGGGCCTAC CAGTACGGCG GGCCGGATCT GACGGTGGCC ACCGTCAAAC GGGCCACCGG CGTGCCCATC GACCACTACA TCGAGGTCAA TTTCTACGGC TTCGTGAACA TGGTGGACGC CATCGGCGGC GTGGACGTGT GCGTGGAGAA GGCCGTTTAC GACAAAAAGA GCGGGCTGAA GCTGCCCGCC GGCACCACCC ACGTCAACGG GCTGCAGGCG CTGGCCTTCG CCCGGGCCCG CTACAGCATC GGCAACGGCA GCGACCTGGG CCGCATCGAG CGCCAGCAGC AGTTCATGGC CTCCCTGCTC AAGCAGGCGC TCAACACCAA GACCCTCAGC GATCCGGTCA AGTCCACCCG GTTCCTCAAT GCCGCGTTGA AGACGCTGCG GGTGGACGAG AAGCTGGCCA AGAACCTGCC CGCCCTGGCC GATCAGATGA AGGACCTGTC CACCGACAAC GTCACCTTCG TCAAGATCCC GCTCAAGAGC GAGAATTACA TGACGCCGAT CAACGGGTCG GCGCCCCAGT CCACCGTGCT GTGGGACCGG GAGAAGGCCG AAGAGGTGTT CGCCAAGATC CGGCGGGATC AGCCGTTCGA TCCTCCCTCG CCCAAGCCGG CCTCTCCCTC GCCCACCGTG GACCCCGACG CTCCCACCGT GCAGCCCCGT GACATCAACG TGGTGGTCCG CAACGGCGTG GGCACCCCCG GTCTGGCCGC CAGGGCCGCC GGCGACCTGC GCCGGGTGGG TTTCGGCACC TCGGTGCCGC CCGGGGTGGC CCGCACCGGC CTGCGCACCA CCCAGATCCA CTACGGGTCG GCCAATGTGG ACGCCGCCAA GACGCTGGCC GCCGCCATCC CCGGCGCCCG TCTCAAGAAG GTTCCGTCCC TCGGCGACAC CATCCAGGTC ATCGTGGGTT CCGACTGGAA GGGCGCCAAG AAGGTCAAGA TCGCCTCACT GCCCGGCGCG CCGTCCGATG AGGACACCGG CCCTCAGGTG AGCACGGCCT CGCAGAAGCT GTGCAAGTGA
|
Protein sequence | MPDVGDSDPI ERYFRPRPES PTGPEEGTVP ERPGEGVTVD GVPAPRVTVE AAGGGPRRPR RSMSHRSAAG ARRQRRFLMT VGAMSALVLV TSGGAWGFQN YVTGALDKVR VGGLGDGGGP KGAMTILVAG VDRRDGLTPE QQKAARLGRA EGERSDTMML VHISRDHDRI SVVSLPRDSL VIIPAHRSNG SEGPKGTPIP ARYGKLTWAY QYGGPDLTVA TVKRATGVPI DHYIEVNFYG FVNMVDAIGG VDVCVEKAVY DKKSGLKLPA GTTHVNGLQA LAFARARYSI GNGSDLGRIE RQQQFMASLL KQALNTKTLS DPVKSTRFLN AALKTLRVDE KLAKNLPALA DQMKDLSTDN VTFVKIPLKS ENYMTPINGS APQSTVLWDR EKAEEVFAKI RRDQPFDPPS PKPASPSPTV DPDAPTVQPR DINVVVRNGV GTPGLAARAA GDLRRVGFGT SVPPGVARTG LRTTQIHYGS ANVDAAKTLA AAIPGARLKK VPSLGDTIQV IVGSDWKGAK KVKIASLPGA PSDEDTGPQV STASQKLCK
|
| |