Gene Gobs_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1708 
Symbol 
ID8753379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp1768825 
End bp1770309 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content73% 
IMG OID 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_003408793 
Protein GI284990239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAACC TCCTCATCGG TGGCTCCTGG CGGCCGGGGG CCTCGTCGGA CCAGCGCCAG 
GTCGTCAACC CGTTCGATCA GAGCGTCGTC GCGAAGGTGG ACGAGGCGAC GGTCGAGGAT
GTCGCCGCGG CCGTCGGAGC GGCCCGGGCC GCCTTCGACG CGGGGGAGTG GTCCGGCACC
CCGGCGGCCG AGCGGGGTGC CCTGCTGCGC CGCGTCGCCG ATCTCCTGGT GCGCGACCGG
GAGGACATCG CCCGGCTGGA GACGCTGGAC ACCGGCAAGA CGGTCGGCGA GAGCCGGCAG
GACGTCGACG ACGTGGCCGC GGTCTTCCGC TACTACGCGG ACCTGGCGGA CAAGGACCCC
GGGCGTCTGG TCGACGCCGG GCGTCCCGGC GTCGTCAGCC GCGTCGTCCA CGAGCCGGTC
GGCGTGTGCT CGCTCATCAC GCCGTGGAAC TATCCGCTCC TGCAGGTGTC CTGGAAGGTC
GCGCCGGCGC TCGCGGCCGG GAACACCGTC GTCGTCAAGC CGAGTGAGGT CACCCCGCTG
ACCACCATCC GGCTCACCGA GCTGCTGGAG GAGGCCGGCG CCCCGGCGGG CGTGGTCAAC
CTCGTCCTCG GTGGCCGGGA CGTCGGGGCG GCGATGGTCG AGCACCCCGA CGTGGACATG
GTGTCGTTCA CCGGCGGCCT CAGCACGGGG CAGGAGATCC TGCGGGCCGC CGCCGAGACG
GTGAAGAAGG TGACCGTCGA GCTCGGTGGC AAGAACCCCA ACGTGGTCTT CGCCGACGTC
GACCTCGACA CCGCCGTCGA CAACGCGCTG ACGGCGGCCT TCCTGCACTC CGGCCAGGTG
TGCTCGGCGG GGACCCGGGT CATCGTGCAG GACGGGATCC ACGACGAGTT CGTCGCGGCC
CTGGTCGAGC GCGCCCGGCG CATCCGCCTC GGCAACGGGT TCGACCCGGC GACCGAGAGC
GGACCGCTGG TGTCCGAGGA GCACCGGGCG AGCGTGGAGG AGTACGTCGC GATCGGCGTC
CGCGAGGGCG CCCGGCTGGT CACCGGTGGC CGCCGTCCCG ACGAGCCGGA GCTGGCCGGC
GGGTTCTTCT ACCTGCCGAC GGTGTTCACC GACTGCCGGC GGGACATGCG GATCGTGCAG
GAGGAGTCCT TCGGTCCCGT CCTCACCGTG GAGCGGTTCG CCACGGAGGA GGAGGCGATC
GCGCTCGGCA ACGACACCGA CTACGGGCTC GCCGGTGCGG TCTGGACCGC CGACACCGCC
CGCGCCGAGC GGGTGGCCCG CGCCCTCCGC CACGGAACGG TCTGGATCAA CGACTTCGGC
CCGTACCTCC CGCAGGCGGA GTGGGGCGGG TTCGGACGAT CGGGCAACGG GCGCGAGCTC
GGCCTGGCAG GTCTCGCCGA GTACCGCGAG GCCAAGCACA TCTGGCACAA CACCCGTCCC
GCCCGGCAGG ACTGGTTCGG CCGGCCCGCG GGGAACGGGG CCTGA
 
Protein sequence
MANLLIGGSW RPGASSDQRQ VVNPFDQSVV AKVDEATVED VAAAVGAARA AFDAGEWSGT 
PAAERGALLR RVADLLVRDR EDIARLETLD TGKTVGESRQ DVDDVAAVFR YYADLADKDP
GRLVDAGRPG VVSRVVHEPV GVCSLITPWN YPLLQVSWKV APALAAGNTV VVKPSEVTPL
TTIRLTELLE EAGAPAGVVN LVLGGRDVGA AMVEHPDVDM VSFTGGLSTG QEILRAAAET
VKKVTVELGG KNPNVVFADV DLDTAVDNAL TAAFLHSGQV CSAGTRVIVQ DGIHDEFVAA
LVERARRIRL GNGFDPATES GPLVSEEHRA SVEEYVAIGV REGARLVTGG RRPDEPELAG
GFFYLPTVFT DCRRDMRIVQ EESFGPVLTV ERFATEEEAI ALGNDTDYGL AGAVWTADTA
RAERVARALR HGTVWINDFG PYLPQAEWGG FGRSGNGREL GLAGLAEYRE AKHIWHNTRP
ARQDWFGRPA GNGA