Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4222 |
Symbol | |
ID | 8755916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 4432413 |
End bp | 4435370 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein of unknown function UPF0182 |
Protein accession | YP_003411155 |
Protein GI | 284992601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCATGC GGCCCCCTGT AGCCGTGCCG ACCCTGTCGC GGCGCGCGAA GGTGGTCATC GGGGCCATCG CCGTCCTGCT CGTGCTGTTC ACGGCCATCG GGACGCTCAC CAACGTCTAC GTCGACTACC TGTGGTTCGA CGAGACCGGG TTCACCGAGG TCTTCTGGAC CGAGCTGCAG ACCCGCGCCC TGCTGTTCGC CGTGGCCGGC GTGGCCACCG GTGGGCTCAC CGCGCTGGCG ATCCACCTGG CCTACCGGTT CCGCCCGACC TTCCGGCCGA TGTCGCTGGA GCAGCAGAAC CTCGAGCGCT ACCGGCAGTC GATCGAGCCG CGCCGCACCC TGGTGCTCAC CTCGGTCGCC GTCGTGCTGG GCCTGTTCGC GGGGTTCACC GCTCAGGGCA GCTGGGAGAC CTGGCTGCAG TTCCGCAACA GCACGGGCTT CGGCCGGGTG GACCCGGAGT TCGGCCTGGA CATCTCGTTC TTCGTCTTCG ACTACCCCTT CTACCGGCTG CTGCTGAGCT TCGGCTTCGC GATCGTGATC CTCGCGCTGA TCGGCTCGCT GCTGACCCAC TACGTGTTCG GTGGCCTGCG GCTGCAGACC CCGGGACAGA AGCTCACCGG CGCGGCCATG GTGCAGCTGT CGGTGCTGCT CGGGCTGTTC GTCGCGCTCA AGGCCGTCGC CTACTGGCTG GACCGCTACG CGCTGGTCTA CTCCGACCGA GGCGGCCTGT TCACCGGCGC CAGCTACACC GACGTCAACG CGCTGCTGCC GGCCAAGACG ATCCTGGTCT TCGCCGCGGC CGTCTGTGCG GTCGCGTTCC TCGCCAACGT CGTCGTCCGC AACTTCCGGC TGCCGGCCGC GGCGCTGGTG CTGCTGCTGA TCTCCAGCCT GGTGATCGGC GTGGCCTACC CGGCGATCGT GCAGCAGTTC GTCGTCCGGC CCAGCGCCAA CGAGCGCGAG GCCGACTTCA TCGCCCGCGC GATCGAGTCG ACCCGCCAGG CCTACGGCCT GGCCGACGTG GAGTACGTCG ACTACGCCCA GCAGGAGACC GGCGAGGAGG TCGACCCGGC CGCGGCGCTG GCCGAGCTGC GCAACGACAC CGAGACGATC CCCAACGCCC GGCTGCTGGA CCCCAACGTC CTGTCCGCCA CGTTCACCGC GCGCCAGCAG ATCCGCAACG TGTACGGCTT CCCCGAGAAG CTCGACATCG ACCGGTACAC GGTCAACGGC GAGACGCAGG ACTACGTCGT AGCGGTCCGC GAGCTCAACA GCCAGGGGCT CAGCGAGAAC CAGGACACCT GGATCAACCG GCACACCGTC TACACGCACG GCAACGGGTT CGTGGCCGCG CCGGCCAACC AGGTCGTCGC CGGCCAGGAG GGCGGCGAGC CGCGCTTCAC CACCCGGGAC CTGCCCACCC GCGGCAACAT CGAGGTCAGC GCCGACGGTG CGCGGATCTA CTACGGCGAG CTGATGCAGG ACTACTCGGT CGTCGGCGCC CCCGAGGGTG GTGAGCCGCG GGAGTTCGAC CTGCCCGAGG GCAGCGACGG CGAGGGGCAG ATCAACAACA CCTACGACGG CCGGGGTGGT GTCGAGGTCG GCAGCTTCTT CCGGCAGCTG ACCTTCGCGA TCTTCTACCG GGAGCGGAAC TTCCTGCTCT CCAGCGCCGT CAACGACGCC TCCAAGGTGC TCTACGTCCG CGACCCGATG GACCGGGTGG AGAAGGCCGC GCCGTTCCTC ACGGTGGACG GCGACCCGTA CCCCGCGGTC ATCGACGGCC GGGTGCAGTG GATCCTCGAC GGCTACACCA CCTCGGGCTC CTACCCCTAC GCCGAGCAGA TGGAGCTGGG CGAGGCGGCC ACCGACGCGC TGACCGGCAC CGGGACGACG GCGCTGCCGA ACGAGACGTT CAACTACATC CGCAACTCGG TGAAGGCCAC CGTCGACGCC TACGACGGCA CCGTCTCGCT CTACGAGTGG GACACCGAGG ACCCGGTCCT GCAGACCTAC ATGAAGGCCT TCCCCGGGCT GGTCCAGCCT CGTGAGGACA TGTCGCCGGA CCTGGTCGGC CACGTCCGCT ACCCGGAGGA CCTGTTCAAG GTCCAGCGGG ACATCCTGAC CCGCTACCAC GTCAGCGACC CGGGCGACTT CTACAGCGGC AACGACCGCT GGGCCGTCCC TGCCGACCCG ACGCAGGACA CCCAGGAGCC GCAGCCGCCG TACTACATCC TGGCCCAGCG GCCGGGCGAC CCGGAGGCGA GCTTCCAGCT GACCAGCGCG CTCAACGCCT TCCGCCGCGA GAACCTGTCG TCGTTCGTCT CGGCGTCCAG CGCGCCGGAC ACCTACGGGC AGATCCAGGT GCTGACCCTG CCGGGCAACA CGCCGTTCCG GGGCCCGCAG CAGGTGCAGC AGTCGTTCAT CACCAACAAC CAGGTGCGGC CGGACCTCAC GCTGTTCAAC AGTGCGGAGT CCCGGGCGGT GTTCGGCAAC CTGCTCACCC TGCCGATCGG CGACAACGGC CTGCTCTACG TCGAGCCGCT GTACGTCGAG GGCACGGGCG AGAACTCCTT CCCGCTGCTG CAGAAGGTGC TGGTCAACTA CGGCGACCGG GTCGGGTACG CCAACACCCT CGCCGAGGCG CTGGACCAGG TGTTCGGCGC CGGGGCGGGG GAGGCCGCCG TCGACAACGA CAACGCCCCC GCACCCACCG ACCAGCCCGA TGCGCCGGCG ACCCCGGCTC CGCCGGCCGA CGGCGGGACG GCGGACACCC CGAGCACCCC GGAGATGCAG TCGGCGGTCC AGGCCATCAA CAGCGCGCTG GCCGCGTTGG AGACGGCGCA GCGCAACGGC GACTTCGCCG GGCAGGGACA GGCCCTCGAG GACCTGCAGG CCGCCGTCAC CGCGTACCAG ACCGCGCAGG CCCAGGCCGC CCAGGCGGCC ACGACACCGG GGGGCTGA
|
Protein sequence | MAMRPPVAVP TLSRRAKVVI GAIAVLLVLF TAIGTLTNVY VDYLWFDETG FTEVFWTELQ TRALLFAVAG VATGGLTALA IHLAYRFRPT FRPMSLEQQN LERYRQSIEP RRTLVLTSVA VVLGLFAGFT AQGSWETWLQ FRNSTGFGRV DPEFGLDISF FVFDYPFYRL LLSFGFAIVI LALIGSLLTH YVFGGLRLQT PGQKLTGAAM VQLSVLLGLF VALKAVAYWL DRYALVYSDR GGLFTGASYT DVNALLPAKT ILVFAAAVCA VAFLANVVVR NFRLPAAALV LLLISSLVIG VAYPAIVQQF VVRPSANERE ADFIARAIES TRQAYGLADV EYVDYAQQET GEEVDPAAAL AELRNDTETI PNARLLDPNV LSATFTARQQ IRNVYGFPEK LDIDRYTVNG ETQDYVVAVR ELNSQGLSEN QDTWINRHTV YTHGNGFVAA PANQVVAGQE GGEPRFTTRD LPTRGNIEVS ADGARIYYGE LMQDYSVVGA PEGGEPREFD LPEGSDGEGQ INNTYDGRGG VEVGSFFRQL TFAIFYRERN FLLSSAVNDA SKVLYVRDPM DRVEKAAPFL TVDGDPYPAV IDGRVQWILD GYTTSGSYPY AEQMELGEAA TDALTGTGTT ALPNETFNYI RNSVKATVDA YDGTVSLYEW DTEDPVLQTY MKAFPGLVQP REDMSPDLVG HVRYPEDLFK VQRDILTRYH VSDPGDFYSG NDRWAVPADP TQDTQEPQPP YYILAQRPGD PEASFQLTSA LNAFRRENLS SFVSASSAPD TYGQIQVLTL PGNTPFRGPQ QVQQSFITNN QVRPDLTLFN SAESRAVFGN LLTLPIGDNG LLYVEPLYVE GTGENSFPLL QKVLVNYGDR VGYANTLAEA LDQVFGAGAG EAAVDNDNAP APTDQPDAPA TPAPPADGGT ADTPSTPEMQ SAVQAINSAL AALETAQRNG DFAGQGQALE DLQAAVTAYQ TAQAQAAQAA TTPGG
|
| |