Gene Gobs_4886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4886 
Symbol 
ID8756588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp5099601 
End bp5101316 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content72% 
IMG OID 
Productalpha amylase catalytic region 
Protein accessionYP_003411789 
Protein GI284993234 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCCG ATCCCCCGTG GTGGACCCGC GCCGTCGTCT ACCAGGTCTA CCCGCGCTCG 
TTCCAGGACT CCGACGGCGA CGGGATCGGC GACCTCGGCG GGATCCTGCA GCGCGTCGAC
CACCTCGCCG ACCTGGGCGT CGACGTCGTC TGGCTCTCAC CGATCTACCC CTCGCCGCAG
GCCGACAACG GCTACGACAT CAGCGACTAC ACCGACGTCG ACCCGCTGTT CGGCTCGCTG
GCGCAGCTCG ACGAGCTGAT CACCGCCCTG CACGAGCGTG GGATGAAGCT GGTGATGGAC
CTGGTGGTCA ACCACACCAG CGACCAGCAC CCGTGGTTCG AGGAGAGCAG GGCGTCGCGG
ACGTCTCCGA GGCGCGACTG GTACTGGTGG CGGCCGCCGC GCGCGGGCAG AACGGCCGGG
CAGCCCGGCG CCGAGCCGAC CAACTGGCAC TCGTACTTCT CCGGGCCCAC CTGGGAGCTC
GACGAGGCCG GCGGCGAGTA CTACCTGCAC CTGTTCGCGC GCGAGCAGCC GGACCTGAAC
TGGGAGAACC CCGAGGTCCG CCAGGCGGTG TACGCGATGA TGCGGAACTG GCTCGACCGC
GGCGTCGACG GCTTCCGCAT GGACGTCATC AACATGATCA GCAAGGACGT CGCCCCGGAC
GGCTCGCTGC GCGACGGGCC CCCGCTGCCT GGCCCGCCCT ACGGCGACGG GACGGCGTCC
TTCCTCTGCG GGCCACGGAT CCACGAGTTC CTGCAGGAGA TGCACCGCGA GGTCTTCGCC
GGGCGGTCCG ACCGGCTGCT GACCGTGGGC GAGATGCCCG GCGTGACCGT GGAGCAGGCG
CGGCTGTTCA CCGACCCGGC GCGGGCCGAG GTGGACATGG TGTTCCAGTT CGAGCACGTC
GGCCTGGACT TCGACGAGTC CAAGTGGCGC CCGCGGCCGC TGCGGATGCG CGACCTCAAG
GCCTCCTTCG GCCGCTGGCA GACGGGGCTG GCCGACGTCG GCTGGAACTC CCTCTACTGG
GACAACCACG ACCAGCCGCG CGCCGTCTCG CGCTTCGGCG ATGACTCGCC CCGGTACCGC
CGCGACTCCG CGACCTGCCT GGCCACCCTG CTGCACCTGC ACCGCGGGAC GCCCTACGTC
TACCAGGGCG AGGAGCTGGG GATGGCCAAC GCCCCGTTCG ACAGCATCGA CGACTTCCGG
GACGTCGAGT CGCTCAACCA CTTCACGCAG GCGGTCGCGC ACGGCGAGGA CCCGGAGACG
GTGCTGGTCG TGCTGCGCCG GATGAGCCGG GACAACGCGC GCACGCCGGT GCAGTGGGAC
GCCTCGCCGT CCGCCGGCTT CACCACCGGC ACGCCGTGGA TCCCGGTCAA CCCCGACTCC
ACCGAGTGGA ACGCCGAGGC CCAGCGCGCC GACCCCACCT CGGTGTTCGC CCACCACAAG
CGGCTGATCG CGCTGCGGCA CGACGACCCG GTGGTCGCGC TCGGTGACTT CACCATGCTG
CTGCCCGAGC ACGACGAGCT GTACGCCTTC ACCCGCAGCC TGGACGGCGC GACGCTGCTC
GTCGTCTGCA ACCTCGGCGC CTCGACGCAC CCGCTGGGCG AGCTGCTGCC CGAGGCCGCC
GGGGCCGAGC TGATGCTCGG GAACCTGACC GACGAGGGTG ACCCCGCCGT CCTGCGGCCG
TGGGAGGCAC GGGTGCTGCG CCCGCGGAGC GCCTGA
 
Protein sequence
MPADPPWWTR AVVYQVYPRS FQDSDGDGIG DLGGILQRVD HLADLGVDVV WLSPIYPSPQ 
ADNGYDISDY TDVDPLFGSL AQLDELITAL HERGMKLVMD LVVNHTSDQH PWFEESRASR
TSPRRDWYWW RPPRAGRTAG QPGAEPTNWH SYFSGPTWEL DEAGGEYYLH LFAREQPDLN
WENPEVRQAV YAMMRNWLDR GVDGFRMDVI NMISKDVAPD GSLRDGPPLP GPPYGDGTAS
FLCGPRIHEF LQEMHREVFA GRSDRLLTVG EMPGVTVEQA RLFTDPARAE VDMVFQFEHV
GLDFDESKWR PRPLRMRDLK ASFGRWQTGL ADVGWNSLYW DNHDQPRAVS RFGDDSPRYR
RDSATCLATL LHLHRGTPYV YQGEELGMAN APFDSIDDFR DVESLNHFTQ AVAHGEDPET
VLVVLRRMSR DNARTPVQWD ASPSAGFTTG TPWIPVNPDS TEWNAEAQRA DPTSVFAHHK
RLIALRHDDP VVALGDFTML LPEHDELYAF TRSLDGATLL VVCNLGASTH PLGELLPEAA
GAELMLGNLT DEGDPAVLRP WEARVLRPRS A