Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4515 |
Symbol | |
ID | 8605875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 5117394 |
End bp | 5119781 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Vault protein inter-alpha-trypsin domain protein |
Protein accession | YP_003302080 |
Protein GI | 269128710 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTCC GCATCACACC GCTGCCCGAG CAGGCGGCAC CGCTGTCCGG CGCGGGCCTG GGCGCGCTGG CCACCGAGCG GGGCAACCTC CCCTTGGAGA CCGTCGACGT CCGCGCCGCC ATCACCGGGC TGAACGGACG CGTCGAGCTG AGCCAGGGCT TCCGCAACCC CTTCGACGTT CCGCTGGAGG CCGTCTACAT CTTCCCGCTG CCGGACCGCG CCGCCGTCAC CCGCATGCGG ATGGAGACCG CCGACGGCAC CGTGGAGGCC GCGCTCAAGG AACGCGGCCG GGCCAGGGCC GACTACCAGG AGGCCGTCGC CGCCGGGCAC CGCGCCGCGC TCGCCGAAGA GGACCGGCCG GATGTGTTCA CCATGCAGGT CGGCAACATC CCGCCCGGCG AGCGCGTCAC CGTCCGCCTG ACCCTCGACC AGCCGCTGCC GTATGAGGAC GGCGCCGCCA CCTTCCGTTT CCCGCTCGTC GTGGCGCCCC GCTACATCCC CGGCACCGCA CTGCCGGACG AGCGGGCCGG GGACGGGATC GCCGCCGACA CCGACGCGGT GCCGGACGCC TCCCGCATCA GCCCGCCCGT CCTGCTGCCC GGCTTTCCCG ATCCGGTGCG GCTGTCCCTG GAAGCGGACA TCGACCCCGC CGGGTTCCCG CTCGGGGAGA TCCGCTCCAG CCTGCACGTG GTGGCCGCCG ACACCCGCGG CTCCGGGCGC ACCACGGTGC GGCTGCAGCC CGGGGAACGG CTCGACCGCG ACTTCGTGCT GCGTCTGGCC TACGGCCGCC CGGAGCAGGC CGCCGCCTCC GTGACGCTGA CCCCCGACGC CGAAGGCGAA TCCGGGACGT TCACCCTCAC CGTGCTGCCG CCGTCCGAGC GGTGCGCGCC CCGGCCGCGC GATGTGGTCA TCCTGCTGGA CCGCTCGGGC AGCATGCACG GCTGGAAGAT GGTGGCGGCC CGCCGCGCCG CGGCCCGCAT CGTGGACACC CTGACCGGAC GGGACCGCTT CGCGGTGCTG TCGTTCGACG ACATGGTCGA GCGGCCCGCC GGGCTGGACG GCGGGCTCAG CCCGGCCACC GACCGCAACC GGTTCCGCGC CGTGGAGCAC CTGGCCGGGC TGCAGGCGCG CGGCGGCACC GAACTGGCGG CCCCGCTCCG GGAGGGCGCC GCCCTGCTGG ACGACGCGGG CCGCGACCGG GTGCTGGTGC TGATCACCGA CGGGCAGGTG GGGAACGAGG ACCAGCTCCT GGCCCTCATC GACCCCTTCC TGAACGGCCT GCGGATCCAC GCGGTCGGCA TCGACCAGGC GGTCAACGCC GGTTTCCTGG GACGGCTCGC CACCGCCGGG CAGGGGCGGC TGGAACTGGT GGAGTCCGAA GACCGGCTGG ATGAGGCGAT GGAGCACATC CACCACCGCA TCAACGCCCC GCTGCTGACC GGCCTGTCCC TGGAATGGTC CGGGGTGCGC GTCGAGCCCG GCACGCTCAC CCCGCCCCGG CCGGGCGCCG TCTTCCCCGG CGTCCCCCTG GTGGTGGCGG GCCGCTACCT GCGCGCCGCC GACGACTGCT CCCTGACCGT CCGCGCCGTG ACCGGCGAAG GCGCCCCCTG GCAGTGCCGG GCCGTCGTCT CCCGCACCGA GGCCGGCGCC GCCACCTCCG TCTGGGCCCG CGCCCACCTG CGGGATCTGG AGGACCGCTA CGCCGTCACC GGCGGGCGCG ACCTGGAGCA GCAGATCGTG GAGACCTCGC TGCGCTTCGG CGTGCTGTGC CGCTTCACCG CCTTCGTCGC CATCGACGAC CGGACCGTCG CCGGCGACCC CGCCCACCGG GTCGTGCAGC CCGTCGAGTC ACCCGCCGGC TGGCCGCCCC CGGCCCCGCC CGTGGCCGCG GCCTCCTCGC ACGCGATGCC CGCCATGCCC ATGGCCGCCC CTGCCGGCCC GCTCCCCCCG CCTCCCGCCG GTCCCGCTCC CCTCGCCACA CCGCCGAGGG GCCTGCCGCC CCGGCTGCGG CGCCCCCGCG GACCCGCCCC CAGCCCGTCC GTTGCGGCCC CGTCCGCGCC CCGCCCGTCG CCCGTCCCCG CCCTGGTCCG GCAGGAGCTG AACCGCCTGC AAGAGCTGGC CGCCGCCGCC CGCCCCGAGC GCCGCCGCCA CCTGTCCGAC CTGCGCACCA GGCTGCTCGG CCTCGCCCAG CACGCCCGCT CCGACGGCCT GCCCGCCGAC CGCCTGACCG CCCTGGCCGA CGCCCTGCAG GCCGCCGACG ACCCCGCCCA GCAGTCCCGC CTGGACGAGC TGTGGACCCT CGCCGTCGGG GAACTGACCG CCTTCCTCGA CGAGCACACC GCCACCCCGA AGCCACCCCG CCGCCCGTTC TGGAAGAAAC CCTCCTGA
|
Protein sequence | MTVRITPLPE QAAPLSGAGL GALATERGNL PLETVDVRAA ITGLNGRVEL SQGFRNPFDV PLEAVYIFPL PDRAAVTRMR METADGTVEA ALKERGRARA DYQEAVAAGH RAALAEEDRP DVFTMQVGNI PPGERVTVRL TLDQPLPYED GAATFRFPLV VAPRYIPGTA LPDERAGDGI AADTDAVPDA SRISPPVLLP GFPDPVRLSL EADIDPAGFP LGEIRSSLHV VAADTRGSGR TTVRLQPGER LDRDFVLRLA YGRPEQAAAS VTLTPDAEGE SGTFTLTVLP PSERCAPRPR DVVILLDRSG SMHGWKMVAA RRAAARIVDT LTGRDRFAVL SFDDMVERPA GLDGGLSPAT DRNRFRAVEH LAGLQARGGT ELAAPLREGA ALLDDAGRDR VLVLITDGQV GNEDQLLALI DPFLNGLRIH AVGIDQAVNA GFLGRLATAG QGRLELVESE DRLDEAMEHI HHRINAPLLT GLSLEWSGVR VEPGTLTPPR PGAVFPGVPL VVAGRYLRAA DDCSLTVRAV TGEGAPWQCR AVVSRTEAGA ATSVWARAHL RDLEDRYAVT GGRDLEQQIV ETSLRFGVLC RFTAFVAIDD RTVAGDPAHR VVQPVESPAG WPPPAPPVAA ASSHAMPAMP MAAPAGPLPP PPAGPAPLAT PPRGLPPRLR RPRGPAPSPS VAAPSAPRPS PVPALVRQEL NRLQELAAAA RPERRRHLSD LRTRLLGLAQ HARSDGLPAD RLTALADALQ AADDPAQQSR LDELWTLAVG ELTAFLDEHT ATPKPPRRPF WKKPS
|
| |