Gene Gobs_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1930 
Symbol 
ID8753601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2003033 
End bp2004163 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID 
Productalanine dehydrogenase 
Protein accessionYP_003409004 
Protein GI284990450 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.275818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGC TCGTCGTCGG AGCACCGACC GAGATCAAGG ACAACGAGCG GCGGGTGGCA 
CTCACCCCCG ACGGTGTCGT GGAGCTGCTG CACGACGGTC ACCAGGTCGT CGTGCAGGCC
GGTGCCGGCG TCGGGTCCCG GTTCGCCGAC GACGAGTACG CGGCGGCCGG CGCCAAGGTC
GTGCCGACCG CCGAGGAGGT GTTCAACGCG GCCGACCTCA TCGTCAAGGT CAAGGAGCCG
GTGCCCGCGG AGTACGACCG CTTCCGCCGG GGCCAGCAGC TGTTCACCTA CCTGCACCTC
GCCGCCGACC GCGGGCTGAC CGAGTTCCTG CTGAAGCGGC GGATCGACTC CATCGCCTAC
GAGACCGTGC AGACCGCTGA TGGCAAGCTC CCGCTGCTGA CCCCCATGAG CGAGGTCGCG
GGCCGGATGG CCGTGCAGGC CGCCGCGCAC CACCTGGAGA ACCCGGCCGG TGGAGCGGGG
ATCCTGCTCG GCGGCGTCCC CGGCACCCCC GCGGCGAAGG TCCTCATCAT CGGCGGCGGG
GTGGCCGGCA CGGAGGCGGC GAAGATCGCG CTGGGGATGC GGGCCATCGT CCGGGTCCTC
GACACCAACC CGAGCCGACT GGCCTACCTG TCCGACATCT TCGGCGGGCG GCTGGACCTG
GTGACGCCCA ACCGCGCCCG GACGGCGGCC TACGTCGCCG AGGCCGACGT CGTGATCGGC
GCGGTCCTCG TGCCCGGCGC CAGGGCACCC AAGCTCGTCA GCAGGGACAT GATCGCCGCG
ATGCGCCCGG GCAGCGTGGT CGTCGACATC GCGATTGACC AGGGCGGCTG CTTCGAGACC
AGCCGGCCGA CCACCCACTC CGACCCCACC TACGTCGAGG AGGGCGTCGT CCACTACTGC
GTGGCCAACA TCCCCGGGGC GGTGTCCCGT ACCTCGACCC TGGCCCTGAC CTCGGCCACG
CTGCCGTACC TGGTCCGGGT CGCGCAGCAC GGCGTGGTCG GCGCGGCCCA GGCCGACCCC
GCCCTGCGTC TCGGGCTCAG CACGCTCGAC GGGCAGCTCG TCAACCAGCC GGTCGCCGAG
GCCCACGAGC TGCCCTTCAC CGACCCCGCC GAGCTCCTCG TCGCACGGTG A
 
Protein sequence
MSTLVVGAPT EIKDNERRVA LTPDGVVELL HDGHQVVVQA GAGVGSRFAD DEYAAAGAKV 
VPTAEEVFNA ADLIVKVKEP VPAEYDRFRR GQQLFTYLHL AADRGLTEFL LKRRIDSIAY
ETVQTADGKL PLLTPMSEVA GRMAVQAAAH HLENPAGGAG ILLGGVPGTP AAKVLIIGGG
VAGTEAAKIA LGMRAIVRVL DTNPSRLAYL SDIFGGRLDL VTPNRARTAA YVAEADVVIG
AVLVPGARAP KLVSRDMIAA MRPGSVVVDI AIDQGGCFET SRPTTHSDPT YVEEGVVHYC
VANIPGAVSR TSTLALTSAT LPYLVRVAQH GVVGAAQADP ALRLGLSTLD GQLVNQPVAE
AHELPFTDPA ELLVAR