Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3072 |
Symbol | |
ID | 8754748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 3219231 |
End bp | 3222365 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | non-ribosomal peptide synthetase |
Protein accession | YP_003410053 |
Protein GI | 284991499 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.273972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCTGG TTCCCGCCGG GTCGGCCGGC CCCACGCCGG CGGCCGACCC GGTGCTCACC GGCTACGCCG CGGTGCTCGC CGAGGTGCTG CAGGTGAACA CCGTCGACCC CGACGCGCAC GTGTTCGAGG ACCTGGGCGC GGACTCGATG GTCATGGCCC GCTTCTGCGC CCGCGTGCGC AAGCAGGCCG ACCTGCCGGC CGTCTCGATC AAGGACGTCT ACCAGCACCC CACGGTCCGC GCCCTCGCGA CCGCGCTCGC GCCGCCACCG CCCGCACCGG CCGCCGACCC GGTCGCGTCG GGCCTGGCGG AGGTGCTCGC CGAGGTGCTG CAGGTCGACT CCGTGGCACC CGACGCGCAC CTCTTCGACG ACCTGGGCGC GGACTCCATG GTGATGGCGC GCTTCTGCGC CCGCGTGCGC AAGCGCGACG ACCTGCCGGC CGTCTCGATC AAGGACGTCT ACCAGCACCC GACGATCAGC AGCCTGGCGG CCGCCCTCGC CCCGGCACCG GCGTCCAGCC TGGTCCAGGA CGGGCTGGCG GAGGTGCTGG CGGAGGTGCT GCAGGTCGAC ACGGTCGCCC CTGACAGCAA CTTCTTCGAG GACCTCGGTG CCGACTCCAT GGTCATGGCC CGCTTCTGCG CCCGCGTGCG CAAGCGCGAC GGCCTGCCCG CGGTCTCGAT CAAGGACGTC TACGCGAACC CGACGGTCCG CGGGCTCGCG TCGGCGTTCG CAGCCGATGA GGCGGGTCCG GCGGCGCCGG ACCCTGCCGC GGGGAACGCC CCCGTGTCGA CCGAGGTGGC GCACCGGGCC AGCAGCCGGG AGTACGCCCT GACCGGAGCA TTGCAGCTGC TGATCTTCAT CGGCTACACC TACGTCACTG GCCTGGTCTT CCTGTGGAGC TTCGACTGGA TCACCGCCGG CGCATCCCTG GTCGACGACT ACCTGCGGTC GGTGGTGGCC GGTGGCACGA CCTTCGTGGT CATCTGCACC TTGCCGATCC TGGTCAAGTG GACGCTCATC GGCCGGTGGA AGCCCCGCCA GATCCCCCTG TGGAGCCCCG GCTACGTGCG CTTCTGGTTC GTCAGGACCA TGGTCATGGC CAATCCGCTG GTCCTGTTCG CCGGGTCACC GGTCTACAAC CTCTACCTCC GGGCGCTGGG CGCGAAGGTC GGCCGCGGTG CCGTGGTCTT CACCCGGCAC GCCCCCGTGT GCAGTGACCT GATCTCCATC GGCGCCGGCG CGGTCGTCCG CAAGGACAGC TACCTCAACG GCTACCGGGC GTACGCCGGG TGGATCCAGA TAGGCCCGGT CACCATCGGG GAGAACGCCT TCGTCGGCGA CAGGGCCGTG CTGGACATCG GCAGCTCGGT CGGCGACGGG GCACAGCTGG GCCACGCCTC CGCGCTCCAC GCCGGGCAGT CCGTCCCCGC TGGCGAGCGC TGGCACGGGT CCCCGGCGCA GCCCACCACG TCGGACTACC AGCGGATCGA ACCGGCCGAC TGCAGCACTT GGCGACGAGC CGCCTACGGC GCGGCGCAGC TGCTGACCGC CCTGCTCGTG TACCTGCCGG TGATGTTCGG CGGTGTGGTC ACGGTGATCG GCACGGTGCC GCAACTGGCC GCGCTGCTCG AGCCGGGGCC GGCGGCCCTG AGGTCAGCGA CGTTCTGGGA GTTCGCGCTG GTCCTGTCCC TGGTCGTCTT CTTCGGCGGC ATCCTCGTCC GCTTCCTGTT CGTCGTGGCC GCCTCCTGGG TGCTCGCACC CTTCCTCAAG CCGGACAAGG TCTATCCCCT GTACGGCTTC GCCTACTCGG TCCACCGGGC GATGCAGCAC TACACCAACA GCAAGTTCTT CATCACGCTC ACCGGTGACA CGTCCTACAT CGTCGGTTAC CTGCGCAGCA TCGGGTACAA GCTGGCCCCC GTCGTGCAGA CCGGGTCGAA CTTCGGCATG GAGGTCAAGC ACGAGGTCCC GCAGCTGACC TCGGTCGGCC GGGGCACCGT CGTCGCCAGT GGCCTGTCGA CACTCAACGC CACCTACTCG AGCACGTCGT TCTCGGTGAC CCGGGCCTCC ATCGGGGCGA ACAGCTTCCT CGGCAACGAC ATCATCTACC CCGCCGGGGC CAGGACCGGC GACGACTGCT TGCTCGCGAG CAAGGTGCTG GTCCCGATCG ACGGGCCCGT CCGGGAGGGG GTCGGCCTCC TCGGGGCACC CGCCTTCGAG ATCCCCCGCA CGGTCGAGCG GGACAGCAAG TTCATGCGGA TGGCCCACGA CGAGGACTTC CCGCGCCGCC TGGCCGCCAA GAACCGGTAC AACCTGGGCA CGATCGGGCT GTTCCTGCTC GCACGCTGGG TGTACTCCTT CGTCCTCACC GTGGTCTCCA TGACCGCCCT CGACGGCCTT GCGGAGCTCG GTGCGTGGGC GATCGCCCTG TCCACGCTCG GCCTGCTGCT CTTCAGCGTC GTCTACTTCT GCAGCCTCGA GCGCACCGCC ACCCGGTTCC TCGGCACAGG ACCCCTGTAC TGCTCCATCT ACGAGATCGA CTTCTGGCAG CGGGAACGCT TCTTCAAGTT CAACGCGCGG ATCGGTGTGC ACCGGCTCGC CGCCGGTACC CCGTTCGCGG CCCTGCTCTG GCGGATCGTG GGCCTCCGGC TCGGCAAGCG GCTGTTCGAC GACGGACACG TCATGGCCGA GAAGACGCTC GTGACGATCG GCGACGACGT CACCCTCAAC GCCGGCTCGT ACATCCAGGT CCATACGCAG GAGGACTACG CCTTCAAGTC CGACGCGACC ACTGTCGGCT CCGGCGTCAC CTTCGGGGTC GGCGCCATGG CGCACTACGG CGTGGTCATC GGGGACGACG TCGTCGTGGA GGCCGACTCC TTCGTCATGA AGGGCGAGGA GGTCCCGTCC GGTGCTCGGT GGGGCGGGAA CCCGGCCGTC GAACTGGCCG GGCCTCCCCC GCCCCTCCCC GCGCCGGCCC CGCGCGAAGA CGACCAGCTC ACCACCCAGT CCCAGGAGGA CGTCATGGAC CTGTTCCGGG GGAATGGCAC GTCCAGCCCG TCCCCGGCCA TCCCGATCCC GCGCAGGAGT GGCCGGCACC GCGCCACCCA GCGCCACCTG GCGGGGTCCC GGTGA
|
Protein sequence | MDLVPAGSAG PTPAADPVLT GYAAVLAEVL QVNTVDPDAH VFEDLGADSM VMARFCARVR KQADLPAVSI KDVYQHPTVR ALATALAPPP PAPAADPVAS GLAEVLAEVL QVDSVAPDAH LFDDLGADSM VMARFCARVR KRDDLPAVSI KDVYQHPTIS SLAAALAPAP ASSLVQDGLA EVLAEVLQVD TVAPDSNFFE DLGADSMVMA RFCARVRKRD GLPAVSIKDV YANPTVRGLA SAFAADEAGP AAPDPAAGNA PVSTEVAHRA SSREYALTGA LQLLIFIGYT YVTGLVFLWS FDWITAGASL VDDYLRSVVA GGTTFVVICT LPILVKWTLI GRWKPRQIPL WSPGYVRFWF VRTMVMANPL VLFAGSPVYN LYLRALGAKV GRGAVVFTRH APVCSDLISI GAGAVVRKDS YLNGYRAYAG WIQIGPVTIG ENAFVGDRAV LDIGSSVGDG AQLGHASALH AGQSVPAGER WHGSPAQPTT SDYQRIEPAD CSTWRRAAYG AAQLLTALLV YLPVMFGGVV TVIGTVPQLA ALLEPGPAAL RSATFWEFAL VLSLVVFFGG ILVRFLFVVA ASWVLAPFLK PDKVYPLYGF AYSVHRAMQH YTNSKFFITL TGDTSYIVGY LRSIGYKLAP VVQTGSNFGM EVKHEVPQLT SVGRGTVVAS GLSTLNATYS STSFSVTRAS IGANSFLGND IIYPAGARTG DDCLLASKVL VPIDGPVREG VGLLGAPAFE IPRTVERDSK FMRMAHDEDF PRRLAAKNRY NLGTIGLFLL ARWVYSFVLT VVSMTALDGL AELGAWAIAL STLGLLLFSV VYFCSLERTA TRFLGTGPLY CSIYEIDFWQ RERFFKFNAR IGVHRLAAGT PFAALLWRIV GLRLGKRLFD DGHVMAEKTL VTIGDDVTLN AGSYIQVHTQ EDYAFKSDAT TVGSGVTFGV GAMAHYGVVI GDDVVVEADS FVMKGEEVPS GARWGGNPAV ELAGPPPPLP APAPREDDQL TTQSQEDVMD LFRGNGTSSP SPAIPIPRRS GRHRATQRHL AGSR
|
| |