Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4801 |
Symbol | |
ID | 8606163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 5442236 |
End bp | 5443333 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | capsule biosynthesis protein, putative |
Protein accession | YP_003302356 |
Protein GI | 269128986 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACGTC GGTCGCGGGT CGCGGCGGCG GGCGCGATGC TGTCGGCGAT GGCGCTGGTC AGCGGGTGCG GGCCGCTGTC ATCGGTGGTG GACGACGCCA ACGGCTACGG CGCGCCGGGC GCCGGCCCGG TCACCGTGGC CTTCGGCGGG GACGTGCACT TCGAAGGCCA GATCCGCGCC CGGCTCGACA GCGACCCGGC CACCGCCCTG GGGCCCATCG CCGAGACGCT CCGCGCCGCC GACGTGGCCA TGGTCAACCT GGAGACCGCC ATCACCACCG GCGGCACCCC GGCCCCCAAG CAGTTCGTCT TCCGCGCCCC GCCCACCGCG TTCACCGCCC TCAAGGCCGC CGGGGTGGAC GTGGCGACGA TGGCCAACAA CCACGGCATG GACTACGGCG AGACCGGTCT GCGCGACTCG CTGGCCGCCG CAAAGCAGGC CGGATTCCCG GTGGTGGGCA TCGGCAACAA CGCCGCCGAG GCCTACAAAC CGTGGGAGAC CGTGGTCCGC GGCACCCGGA TCGGCGTCAT CGGCGCCACC CAGGTGCTCG ACGACCACCT GATCGAGGCG TGGACGGCCA CCGACACCAA GCCCGGCCTG GCCTCCGCCA AGGACGCCCA GCGCCTGGTG CAGGAGGTGA AGGCGAACCG CCACCGCTAC GACGTGCTGA TCGTCAACGT GCACTGGGGC CGGGAGCTGG AGCAGTGCGC CACCGACGCG CAACGGCAAC TGGCCGACCG GCTCGTGACG GCCGGGGCGG ACGCGGTGGT CGGCGGCCAC GCCCACGTGC TGCAGGCCGG CGGCTTCCTC AAGGGCAAGT ACGTCCACTA CGGGCTGGGC AACTTCGTCT TCTACAACTC CGGCCCCGTC ACCGGGCAGA CCGGGGTGCT GACCCTCACC TTCGACCCGC CCGCCGGCCG CCCGCGCCTG CAGGGCGCCA AGGTCACCAA GGCGGTGTGG ACGCCGGCCT TCATCACCGG CGGCATCCCG CAGCCGCTGA GCGGGACCGA GGCCCAGCAG GCCATCGCCC GTTGGGAGGG GCTGCGGTCC TGCGCCGACG TCACCGCCTC CCCGTCCCCC GGCACCCCGG GGCCGTGA
|
Protein sequence | MGRRSRVAAA GAMLSAMALV SGCGPLSSVV DDANGYGAPG AGPVTVAFGG DVHFEGQIRA RLDSDPATAL GPIAETLRAA DVAMVNLETA ITTGGTPAPK QFVFRAPPTA FTALKAAGVD VATMANNHGM DYGETGLRDS LAAAKQAGFP VVGIGNNAAE AYKPWETVVR GTRIGVIGAT QVLDDHLIEA WTATDTKPGL ASAKDAQRLV QEVKANRHRY DVLIVNVHWG RELEQCATDA QRQLADRLVT AGADAVVGGH AHVLQAGGFL KGKYVHYGLG NFVFYNSGPV TGQTGVLTLT FDPPAGRPRL QGAKVTKAVW TPAFITGGIP QPLSGTEAQQ AIARWEGLRS CADVTASPSP GTPGP
|
| |