Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3091 |
Symbol | |
ID | 8604435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 3578945 |
End bp | 3580582 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | Leucyl aminopeptidase |
Protein accession | YP_003300671 |
Protein GI | 269127301 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00268238 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCCGG CCCGCTCCGC AACGGCGAAC TCCTCGGCCG GGACCGGTCA CGGCCCGGCC GGCCGGTCGA GGAACTCCAA GGCGAGTCGA TCAACAGAAC GAAAGCGGTC GGCCAGGGCA CTAGCATCAG ATTGCGTGAC GACCATCAGT CTTACCAGCA CAGCCCCGTC CGCCCTGGAA GTCGACGCGA TCGTCATCGG GATCGCGCCC GGCGGCTCCG ACTCGTCGGA CGGCGCCCCG CCCCGGCTGG CCGACGGCGC CCACGACATC GACCGGGCCT TCGGCGGCAG GCTGGCGACG GCGCTGCAGG CGCTGGGCGC CACCGGCAAG GCCGGTGAGA TCACCAAGCT GCCCACGCTG GGCACGCTGC CCGCGCCCGT CCTGGTGGCG GCCGGGCTCG GGGAGGAGAC CGGCGCCGAT GCGCTGCGGC GCGCCTCGGG CGCGGCGGTG CGCGCGCTGG CCGGCTCGGC CAAGGTGGCC CTCGCACTGC CGGCGGGCAC GGCCGAGGAG GTCGGCGCGG TGGCGCTGGG CGCGTTGCTG GGCAACTACT CCTTCGGCAA GTACCGCACC GGCGAGCACA AAAAGCCGGT CGCGGAGCTG ACCGTGGCCA CCGGCGGCCC GGTCGAGGAC GGCGAGCAGG CCCTGGAGCG GGCGCGGACG CTGGCCTCCT CGGTGACGCT GGTGCGCGAC CTGGTCAACA CCCCGCCCTC GGACCTGGGC CCGGACGATC TGGCCCAGAT CGCCGCGAAG GTGGCCGGCG AGGTCGGCCT GGGGGTGGAG ATCCTCGATG AAAAGGCGCT GGCGGAGGGC GGCTACGGCG GCATCGTGGG CGTCGGGCAG GGCTCGGCCC GCCCGCCCCG GCTGGTGCGG CTGGCCTACT CCCACCCCGA GGCCGCCAAG ACGGTGGTGT TCGTGGGCAA GGGGATCACC TTCGACACCG GCGGCCTGTC GCTCAAGCCG TCCGAGGCCA TGGACTGGAT GAAGTCCGAC ATGGGCGGGG CGGGCGCGGT GCTGGGCGCG CTGCGCGCCA TCGCGCTGCT CAAGCCCAAG GTCAACGTGA TCGGCTACCT GCCGCTGGCG GAGAACATGC CCAGCGGCAC CGCCCAGCGG CCCTCCGACG TGCTCACCGT CTACGGCGGC AAGACCGTGG AGGTGCTCAA CACCGACGCC GAGGGCCGGC TGGTCATGGC CGACGCGCTG GTGCGCTCCG GCGAGGACTC CCCCGACCTG CTGGTGGACG TCGCCACGCT GACCGGTGCG CAGCTGGTGG CGCTGGGCAC CCGCACCTGC GGGGTGATGG CCAACGACGA CGAGGTCCGC GAGAAGGTGG TGGCCGCCGC CACCCGCGCC GGCGAGGCCG CCTGGCCCAT GCCGCTGCCG GCGGAGCTGC GCAAGGGGCT GGAGTCGGCG GTGGCCGACA TCGCCAACAT CAGCGGCGAG CGCTGGGGCG GCATGCTGGT GGCGGGCATC TTCCTCAAGG AGTTCGTCCC CGAGGGCGTC AAGTGGGCGC ACCTGGACAT CGCCGGCCCG GCCTTCAACA AGGGCGAGCC CTACGGCGAG GTCCCCAAGG GCGGGACCGG CGCCGCCACC CGCACCCTGG TGCAGATCGC CGAGGAGGTC GCCGCGGGAA CGCTGTGA
|
Protein sequence | MGPARSATAN SSAGTGHGPA GRSRNSKASR STERKRSARA LASDCVTTIS LTSTAPSALE VDAIVIGIAP GGSDSSDGAP PRLADGAHDI DRAFGGRLAT ALQALGATGK AGEITKLPTL GTLPAPVLVA AGLGEETGAD ALRRASGAAV RALAGSAKVA LALPAGTAEE VGAVALGALL GNYSFGKYRT GEHKKPVAEL TVATGGPVED GEQALERART LASSVTLVRD LVNTPPSDLG PDDLAQIAAK VAGEVGLGVE ILDEKALAEG GYGGIVGVGQ GSARPPRLVR LAYSHPEAAK TVVFVGKGIT FDTGGLSLKP SEAMDWMKSD MGGAGAVLGA LRAIALLKPK VNVIGYLPLA ENMPSGTAQR PSDVLTVYGG KTVEVLNTDA EGRLVMADAL VRSGEDSPDL LVDVATLTGA QLVALGTRTC GVMANDDEVR EKVVAAATRA GEAAWPMPLP AELRKGLESA VADIANISGE RWGGMLVAGI FLKEFVPEGV KWAHLDIAGP AFNKGEPYGE VPKGGTGAAT RTLVQIAEEV AAGTL
|
| |