Gene Gobs_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1978 
Symbol 
ID8753649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2051466 
End bp2053622 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003409044 
Protein GI284990490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.440247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGC TCGATCGCAT CCTCGACGCC CTCGATGACC GCGACCGCGG ACCCCGGCAG 
TCCGGCGGGA GCTGGTTGGC CCGCTGCCCC GCCCATGACG ACCGGAAGCC GTCGCTGAGC
GTGCGGCAGG TCGAAGGCCA CGCCTTGATC TTCTGCCAGG CCGGGTGCGC CGCCGCCGAA
GTGATGACCG CGCTCGACCT CACCTTGCGC GACCTGTACG ACGACCCCCG CGGCGCCACC
TACACCTACC CCGACGGCCG CATCGTTCAC CGTAGCCCCG ACAAGCGCTT CTACCAGTCG
GGCAACACCA ACGGCACCGC CCTGTACCGG CTCGACAAGG TCACCGCCGC CGTCGCCGCT
GAGCAGACCA TCTACATCTG CGAGGGTGAG AAGGACGTAC ATGCCCTCGA GGCCATCGGC
CTGACCGCCA CGACGTCCCC GATGGGCGCC GGCAAGTGGG CCAAGATCGA CCCCACCCCG
CTGGCCGGGG GCACCGTCGT TATCGTCGCC GACGACGACG ACCCCGGCCG GCGGCACGCC
GCCGAGGTCC GCGACTCCCT CATCGGGCTC GGCGCCACCG TCAACACCGT CTCCGCCAAG
GCCGGCAAGG ACGCCGCCGA CCACGTCGCC GCCGGCTACG ACGCCGCCGA CTTCGTACCG
CTGCAGCTGC TGGTCGACGA CTTCGCCGAA CCTCCGCTGC CGCTGACCCC CACCTCCTGT
GCGACACGCT TTCCCACCGA GGTGCTCCCC GGCTGGGCCC GGGCCATGGT GGAGGCCGAA
GCCGAGGCCA CCCAGACCGA CGCGGGGATG GCCGCCAGCG TCCTGCTCGG GGCGCTCGCC
GCTGCCGCCG GCGGACACGC CCGGGTCTGG ATCCGCGCCG GCTGGTCAGA GCCGGTGAAC
ATCTTCACCG TCTCCGTCGC CGAGCCCGGC TCCCGCAAGA CCGCCGTGTT CGGTGCCATG
ACCGATCCAC TCGGCGACGC CGAGAAGGAA CTGGTTGCCA GCGACGCCGC GGCCCGCTGG
GAGACCGACG TGGCGCTCAA GGTCGCCCTG CAGGCCGCGG AGAAGCTGCA GCGGGATGCG
GCCACCGCCG CCGCCGCCAA GGACAAGGCG GCCGACGAGA AGCTTGCCGA CGCGATGAAC
GCGCGGGCCG CCGCCGAGGC CATCACCGTT CCGATCGACC CCCGACTCCT GGCCGACGAC
GTCACCCCGG AGGCGTTGGT GAGCCTGCTC GCCGACAACG GAGGCCGGAT CGCGGTGCTC
AGCGCCGAGG GTGGGATCTT CGACGTGTTG GCCGGCCGGT ACAGCAAGAC CCCCAACCTG
GACCCGATCC TCAAGGGTCA CGCCGGGGAC CGAATCCGGG TGGACCGCAA GGGGCGATCC
TCGGAGTACA TCGACCATCC CGCGCTGACC ATGTGCCTGA CGGTGCAGCC ACGCGTCATC
GAGGAGATCG GCCGCAACGG CGTCTTCGTC GGTCGCGGCC TGCTGGCCCG GTTCCTCTTC
TCGATTCCGC CCAACCGGGT CGGCTACCGC AAGGTCGGTG CGGCACCGGT CCCGCCGGAC
GTTGCCGCCA AGTACGCGGC GCGGATTCAG GCGCTGGTCG CTGCCCTGCA CGAGTGGGGC
GAGATGCCGA TGCTGCTGCA GTTGTCCGCC GACGCGGCGG AGGTGTTCCT GAACGCGGAG
CGCACCCTGG AGCCCCGGCT GGCCGGCGAC CTGCGGCCGG TGCAGGAGTG GGCGAGCAAG
CTGATGGGGG CCACCGCCCG GATCGGCGGT CTGTTGCACC TGGCGAGCTT CGACGCCGCC
GACACCGCCA TGCGTCGCCC GATCAGCGCC GAGACGATGA TCGGCGCTCT GAAGATCGCG
GCCTACTACA CCGACCACGC GATGGTCGCC TTCGGACTCA TGGGTGCAGA CCGAACCCTC
GGTGCCGCAC AGGAGCTCCT GACCCACGTC CACGCCCGCA AGATCGAGGA GTCGAGCATC
CGTGACCTGT TCACCGACCT GTCTCGCAGC CGGTTCCCGA GGACCGAGGA CGTGCTCGAC
GCGCTCGCCG TCCTGGTCTC CCACGGCTGG GCGGCACCGC TGCCGCCTCC GAAGACCTCC
GGGCCGGGCC GCAAACCGTC ACCCCGCTAC CGCTTCCGCC CAGCCCCCAT CGCGTAG
 
Protein sequence
MSALDRILDA LDDRDRGPRQ SGGSWLARCP AHDDRKPSLS VRQVEGHALI FCQAGCAAAE 
VMTALDLTLR DLYDDPRGAT YTYPDGRIVH RSPDKRFYQS GNTNGTALYR LDKVTAAVAA
EQTIYICEGE KDVHALEAIG LTATTSPMGA GKWAKIDPTP LAGGTVVIVA DDDDPGRRHA
AEVRDSLIGL GATVNTVSAK AGKDAADHVA AGYDAADFVP LQLLVDDFAE PPLPLTPTSC
ATRFPTEVLP GWARAMVEAE AEATQTDAGM AASVLLGALA AAAGGHARVW IRAGWSEPVN
IFTVSVAEPG SRKTAVFGAM TDPLGDAEKE LVASDAAARW ETDVALKVAL QAAEKLQRDA
ATAAAAKDKA ADEKLADAMN ARAAAEAITV PIDPRLLADD VTPEALVSLL ADNGGRIAVL
SAEGGIFDVL AGRYSKTPNL DPILKGHAGD RIRVDRKGRS SEYIDHPALT MCLTVQPRVI
EEIGRNGVFV GRGLLARFLF SIPPNRVGYR KVGAAPVPPD VAAKYAARIQ ALVAALHEWG
EMPMLLQLSA DAAEVFLNAE RTLEPRLAGD LRPVQEWASK LMGATARIGG LLHLASFDAA
DTAMRRPISA ETMIGALKIA AYYTDHAMVA FGLMGADRTL GAAQELLTHV HARKIEESSI
RDLFTDLSRS RFPRTEDVLD ALAVLVSHGW AAPLPPPKTS GPGRKPSPRY RFRPAPIA