Gene Gobs_4222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4222 
Symbol 
ID8755916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4432413 
End bp4435370 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content70% 
IMG OID 
Productprotein of unknown function UPF0182 
Protein accessionYP_003411155 
Protein GI284992601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCATGC GGCCCCCTGT AGCCGTGCCG ACCCTGTCGC GGCGCGCGAA GGTGGTCATC 
GGGGCCATCG CCGTCCTGCT CGTGCTGTTC ACGGCCATCG GGACGCTCAC CAACGTCTAC
GTCGACTACC TGTGGTTCGA CGAGACCGGG TTCACCGAGG TCTTCTGGAC CGAGCTGCAG
ACCCGCGCCC TGCTGTTCGC CGTGGCCGGC GTGGCCACCG GTGGGCTCAC CGCGCTGGCG
ATCCACCTGG CCTACCGGTT CCGCCCGACC TTCCGGCCGA TGTCGCTGGA GCAGCAGAAC
CTCGAGCGCT ACCGGCAGTC GATCGAGCCG CGCCGCACCC TGGTGCTCAC CTCGGTCGCC
GTCGTGCTGG GCCTGTTCGC GGGGTTCACC GCTCAGGGCA GCTGGGAGAC CTGGCTGCAG
TTCCGCAACA GCACGGGCTT CGGCCGGGTG GACCCGGAGT TCGGCCTGGA CATCTCGTTC
TTCGTCTTCG ACTACCCCTT CTACCGGCTG CTGCTGAGCT TCGGCTTCGC GATCGTGATC
CTCGCGCTGA TCGGCTCGCT GCTGACCCAC TACGTGTTCG GTGGCCTGCG GCTGCAGACC
CCGGGACAGA AGCTCACCGG CGCGGCCATG GTGCAGCTGT CGGTGCTGCT CGGGCTGTTC
GTCGCGCTCA AGGCCGTCGC CTACTGGCTG GACCGCTACG CGCTGGTCTA CTCCGACCGA
GGCGGCCTGT TCACCGGCGC CAGCTACACC GACGTCAACG CGCTGCTGCC GGCCAAGACG
ATCCTGGTCT TCGCCGCGGC CGTCTGTGCG GTCGCGTTCC TCGCCAACGT CGTCGTCCGC
AACTTCCGGC TGCCGGCCGC GGCGCTGGTG CTGCTGCTGA TCTCCAGCCT GGTGATCGGC
GTGGCCTACC CGGCGATCGT GCAGCAGTTC GTCGTCCGGC CCAGCGCCAA CGAGCGCGAG
GCCGACTTCA TCGCCCGCGC GATCGAGTCG ACCCGCCAGG CCTACGGCCT GGCCGACGTG
GAGTACGTCG ACTACGCCCA GCAGGAGACC GGCGAGGAGG TCGACCCGGC CGCGGCGCTG
GCCGAGCTGC GCAACGACAC CGAGACGATC CCCAACGCCC GGCTGCTGGA CCCCAACGTC
CTGTCCGCCA CGTTCACCGC GCGCCAGCAG ATCCGCAACG TGTACGGCTT CCCCGAGAAG
CTCGACATCG ACCGGTACAC GGTCAACGGC GAGACGCAGG ACTACGTCGT AGCGGTCCGC
GAGCTCAACA GCCAGGGGCT CAGCGAGAAC CAGGACACCT GGATCAACCG GCACACCGTC
TACACGCACG GCAACGGGTT CGTGGCCGCG CCGGCCAACC AGGTCGTCGC CGGCCAGGAG
GGCGGCGAGC CGCGCTTCAC CACCCGGGAC CTGCCCACCC GCGGCAACAT CGAGGTCAGC
GCCGACGGTG CGCGGATCTA CTACGGCGAG CTGATGCAGG ACTACTCGGT CGTCGGCGCC
CCCGAGGGTG GTGAGCCGCG GGAGTTCGAC CTGCCCGAGG GCAGCGACGG CGAGGGGCAG
ATCAACAACA CCTACGACGG CCGGGGTGGT GTCGAGGTCG GCAGCTTCTT CCGGCAGCTG
ACCTTCGCGA TCTTCTACCG GGAGCGGAAC TTCCTGCTCT CCAGCGCCGT CAACGACGCC
TCCAAGGTGC TCTACGTCCG CGACCCGATG GACCGGGTGG AGAAGGCCGC GCCGTTCCTC
ACGGTGGACG GCGACCCGTA CCCCGCGGTC ATCGACGGCC GGGTGCAGTG GATCCTCGAC
GGCTACACCA CCTCGGGCTC CTACCCCTAC GCCGAGCAGA TGGAGCTGGG CGAGGCGGCC
ACCGACGCGC TGACCGGCAC CGGGACGACG GCGCTGCCGA ACGAGACGTT CAACTACATC
CGCAACTCGG TGAAGGCCAC CGTCGACGCC TACGACGGCA CCGTCTCGCT CTACGAGTGG
GACACCGAGG ACCCGGTCCT GCAGACCTAC ATGAAGGCCT TCCCCGGGCT GGTCCAGCCT
CGTGAGGACA TGTCGCCGGA CCTGGTCGGC CACGTCCGCT ACCCGGAGGA CCTGTTCAAG
GTCCAGCGGG ACATCCTGAC CCGCTACCAC GTCAGCGACC CGGGCGACTT CTACAGCGGC
AACGACCGCT GGGCCGTCCC TGCCGACCCG ACGCAGGACA CCCAGGAGCC GCAGCCGCCG
TACTACATCC TGGCCCAGCG GCCGGGCGAC CCGGAGGCGA GCTTCCAGCT GACCAGCGCG
CTCAACGCCT TCCGCCGCGA GAACCTGTCG TCGTTCGTCT CGGCGTCCAG CGCGCCGGAC
ACCTACGGGC AGATCCAGGT GCTGACCCTG CCGGGCAACA CGCCGTTCCG GGGCCCGCAG
CAGGTGCAGC AGTCGTTCAT CACCAACAAC CAGGTGCGGC CGGACCTCAC GCTGTTCAAC
AGTGCGGAGT CCCGGGCGGT GTTCGGCAAC CTGCTCACCC TGCCGATCGG CGACAACGGC
CTGCTCTACG TCGAGCCGCT GTACGTCGAG GGCACGGGCG AGAACTCCTT CCCGCTGCTG
CAGAAGGTGC TGGTCAACTA CGGCGACCGG GTCGGGTACG CCAACACCCT CGCCGAGGCG
CTGGACCAGG TGTTCGGCGC CGGGGCGGGG GAGGCCGCCG TCGACAACGA CAACGCCCCC
GCACCCACCG ACCAGCCCGA TGCGCCGGCG ACCCCGGCTC CGCCGGCCGA CGGCGGGACG
GCGGACACCC CGAGCACCCC GGAGATGCAG TCGGCGGTCC AGGCCATCAA CAGCGCGCTG
GCCGCGTTGG AGACGGCGCA GCGCAACGGC GACTTCGCCG GGCAGGGACA GGCCCTCGAG
GACCTGCAGG CCGCCGTCAC CGCGTACCAG ACCGCGCAGG CCCAGGCCGC CCAGGCGGCC
ACGACACCGG GGGGCTGA
 
Protein sequence
MAMRPPVAVP TLSRRAKVVI GAIAVLLVLF TAIGTLTNVY VDYLWFDETG FTEVFWTELQ 
TRALLFAVAG VATGGLTALA IHLAYRFRPT FRPMSLEQQN LERYRQSIEP RRTLVLTSVA
VVLGLFAGFT AQGSWETWLQ FRNSTGFGRV DPEFGLDISF FVFDYPFYRL LLSFGFAIVI
LALIGSLLTH YVFGGLRLQT PGQKLTGAAM VQLSVLLGLF VALKAVAYWL DRYALVYSDR
GGLFTGASYT DVNALLPAKT ILVFAAAVCA VAFLANVVVR NFRLPAAALV LLLISSLVIG
VAYPAIVQQF VVRPSANERE ADFIARAIES TRQAYGLADV EYVDYAQQET GEEVDPAAAL
AELRNDTETI PNARLLDPNV LSATFTARQQ IRNVYGFPEK LDIDRYTVNG ETQDYVVAVR
ELNSQGLSEN QDTWINRHTV YTHGNGFVAA PANQVVAGQE GGEPRFTTRD LPTRGNIEVS
ADGARIYYGE LMQDYSVVGA PEGGEPREFD LPEGSDGEGQ INNTYDGRGG VEVGSFFRQL
TFAIFYRERN FLLSSAVNDA SKVLYVRDPM DRVEKAAPFL TVDGDPYPAV IDGRVQWILD
GYTTSGSYPY AEQMELGEAA TDALTGTGTT ALPNETFNYI RNSVKATVDA YDGTVSLYEW
DTEDPVLQTY MKAFPGLVQP REDMSPDLVG HVRYPEDLFK VQRDILTRYH VSDPGDFYSG
NDRWAVPADP TQDTQEPQPP YYILAQRPGD PEASFQLTSA LNAFRRENLS SFVSASSAPD
TYGQIQVLTL PGNTPFRGPQ QVQQSFITNN QVRPDLTLFN SAESRAVFGN LLTLPIGDNG
LLYVEPLYVE GTGENSFPLL QKVLVNYGDR VGYANTLAEA LDQVFGAGAG EAAVDNDNAP
APTDQPDAPA TPAPPADGGT ADTPSTPEMQ SAVQAINSAL AALETAQRNG DFAGQGQALE
DLQAAVTAYQ TAQAQAAQAA TTPGG