Gene Gobs_4798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4798 
Symbol 
ID8756499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp5008604 
End bp5010214 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content69% 
IMG OID 
Productmalate synthase A 
Protein accessionYP_003411706 
Protein GI284993151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.412323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCA TCGCCGGCGT GGACGGTCTC GAGATCACCG GCCCCATGGG TGACCGGTTC 
GACGAGGTCC TCACCCCCCG GGCGCTCGAG CTCGTCGTCC TCCTCCACCG CGAGCTGAAC
GGGCGGCGGC TCGAGCGGCT CGCCGCGCGA CAGGAGCGGG TCCGCGCGCT GGCCGACGGC
GCCAGCCTCG ACTTCCTCGA GGAGACCCGG TCGATCCGCG AAGACGACTC GTGGCGGGTG
GCCGACCCGG CGCCCGGCCT GGTCGACCGC CGGGTGGAGA TGACCGGCCC GACCGACCGC
AAGATGACGA TCAACGCGCT GAACTCCGGG GCCAGGTGCT GGCTGGCCGA CCAGGAGGAC
GCGAACTCCC CGCTGTGGGA GAACGTCGTC AACGGCCCTC TCAACCTGAT GGACTCGCTC
GACCGCACGA TCGACTTCAC CAGCCCGCAG GGCAAGAAGT ACGAGCTCAG GCCGGACGAC
GAGCTCCCGA CCATCATCGT GCGCCCCCGC GGCTGGCACC TGCCGGAGAA GCACATCACC
GTCGACGGCG AGCAGACCTC CGGCAGCCTG GTCGACTTCG CGCTGTACCT GGCCGCCTGC
GGCCAGAAGC AGATCGACAA GGGCCGCGGC CCGTACTTCT ACCTGCCGAA GATGGAGAGC
CACCTCGAGG CGCGGCTGTG GAACGACGCC TTCAACCTGG CGCAGGACCA CCTCGGCATC
CCCCGCGGCA CCATCCGCGC CACCTGCCTG ATCGAGACCT ACCCCGCGGC GTTCGAGATG
GAGGAGATCC TCTACGAGCT GCGCGAGCAC TCCGCCGGAC TCAACGCCGG CCGCTGGGAC
TACATGTTCA GCGTGATCAA GACGTTCCGG ACCCGGGGCG AGGAGTTCAC CCTGCCGGAC
CGCAACTCGG TCGGCATGAC TGTGCCCTTC ATGCGGGCCT ACACCGAGCT GCTCGTGCGG
ACCTGCCACA AGCGCGGCGC GCACGCGATC GGCGGCATGT CCGCGTTCAT CCCGAGCAAG
GACCCGGAGG TCAACGAGTT CGCCTTCAAG AAGATCACCG AGGACAAGAC CCGCGAGGCC
AACGACGGCT TCGACGGCTC GTGGGTGGCC CACCCCGGCA TGGTGCAGGC GGCGATGGAC
GTCTTCGACA AGGTGCTGGG CGACAAGCCC AACCAGCTGG ACAACCTGCG GGAGGACGTC
CAGGTCACCG CCGCTCAGCT GCTCGACGTG AAGTCCACTC CCGGCGAGGC CACCGAGGCC
GGGCTGCGGG CCAACATCAG CGTGGGGATC CAGTACGTGG AGTCGTGGCT GCGCGGCTCC
GGCGCGGTCG GCATCAACAA CCTCATGGAG GACGCCGCCA CCGCGGAGAT CTCCCGCAGC
CAGGTCTGGC AGTGGCTGCA CAACGGCGTG CGGCTCTCGA ACGGCGAGCC GGTCACCCGC
GAGCTGGTCG AGCGCCTCAC CGACGAGGAG ATGCAGTCGA TCGCCGAGTC CCGCGGCGAC
TCCTTCGCCT CCGGCCGCTG GGACGACGCC CGCGCGCTGT TCCTGGAGAT GGCCGTGGCC
GACGAGTACT CGGACTTCCT GACCCTGCCC GCCTACGAGC GCATGCCGTG A
 
Protein sequence
MSTIAGVDGL EITGPMGDRF DEVLTPRALE LVVLLHRELN GRRLERLAAR QERVRALADG 
ASLDFLEETR SIREDDSWRV ADPAPGLVDR RVEMTGPTDR KMTINALNSG ARCWLADQED
ANSPLWENVV NGPLNLMDSL DRTIDFTSPQ GKKYELRPDD ELPTIIVRPR GWHLPEKHIT
VDGEQTSGSL VDFALYLAAC GQKQIDKGRG PYFYLPKMES HLEARLWNDA FNLAQDHLGI
PRGTIRATCL IETYPAAFEM EEILYELREH SAGLNAGRWD YMFSVIKTFR TRGEEFTLPD
RNSVGMTVPF MRAYTELLVR TCHKRGAHAI GGMSAFIPSK DPEVNEFAFK KITEDKTREA
NDGFDGSWVA HPGMVQAAMD VFDKVLGDKP NQLDNLREDV QVTAAQLLDV KSTPGEATEA
GLRANISVGI QYVESWLRGS GAVGINNLME DAATAEISRS QVWQWLHNGV RLSNGEPVTR
ELVERLTDEE MQSIAESRGD SFASGRWDDA RALFLEMAVA DEYSDFLTLP AYERMP