Gene Gobs_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1000 
Symbol 
ID8752661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp1059099 
End bp1060517 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content71% 
IMG OID 
Productfumarate lyase 
Protein accessionYP_003408132 
Protein GI284989578 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCTG AGACCCAGGC CGCACAGCAG GACGGCACCG ACTACCGCAT CGAGCACGAC 
TCCATGGGCG AGGTCCGGGT CCCCGCGTGG GCGAAGTGGC GGGCGCAGAC CCAGCGCGCC
GTCGAGAACT TCCCGATCTC CGGCACGCCG ATCGAGCGGG AGCTGATCGG CGCCCTTGCC
GCGATCAAGG GTGCCGCTGC CGCGGTGAAC GCCTCCCTGG GTGTGCTGCC GCAGGAAACC
GCCGACGCGA TCGGCACCGC GGCCGCGTCC GTGGCCCGCG GCGAGTGGGA CGAGCACTTC
CCCATCGACG TCTTCCAGAC GGGGTCGGGG ACGTCGAGCA ACATGAACAC CAACGAGGTG
ATCGCCTCCC TCGCCACCGA GGCGCTCGGC TCGCCGGTGC ACCCCAACGA CCACGTCAAC
GCCTCGCAGT CGTCCAACGA CGTCTTCCCG TCGGCCATCC ACGTGGCCGC GACGCGGGCG
ATCGTCCGCG ACCTGATCCC GGCGCTGCAG CACCTCGAGG CCTCGCTCTC CCGCAAGGCC
GAGGAGTTCG CCGAGGTCGT GAAGAGCGGC CGCACCCACC TGATGGACGC CACCCCGGTC
ACCCTCGGCC AGGAGTTCGG CGGGTACGCC GCGGCGGTGC GCTACGGCGT CGAGCGGCTG
CAGGCGTCGC TGCCGAGGAT CGGCGAGCTG CCGTTGGGCG GCACCGCGGT CGGCACGGGC
ATCAACACCC CGCCCGGGTT CGCCGCCGCG ATCATCGAGC GGCTCGCCGC CGAGCTGGAC
CTGCCGCTCT CCGAGGCGCG CGACCACTTC GAGGCGCAGA GCTCGCGCGA CGCCCTGGTG
GAGGGCTCCG GCCAGCTGAA GACGATCGCC GTCGGCCTGG TGAAGATCGC CAACGACCTG
CGCTGGATGG GTTCCGGCCC GCGCACCGGC CTGGGCGAGA TCCAGCTGCC CGACCTGCAG
CCCGGCAGCT CGATCATGCC GGGCAAGGTG AACCCGGTGA TCCCGGAGGC GGTCATCCAG
GTGAGCGCGC AGGTGATCGG GAACGACGCC GCGGTGACCT TCGCCGGCAC CACCGGCGTC
TTCGAGCTCA ACGTCACGCT GCCCCTGATG GCCCGCAACG TGCTCGAGTC CATCCGGCTG
CTGGCCAACG CCAGCCGGAT CCTGGCCGAC CGCTGCGTCG ACGGGATCGT CGCCAACGTC
GACCGGTGCC GGGAGTACGC CGAGTCCTCG CCGTCGATCG TGACGCCGCT GAACAAGTAC
ATCGGCTACG AGGAGGCGGC CAAGGTCGCC AAGCAGTCGC TGGCCGAGCA GAAGACGATC
CGCCAGGTCG TGGTGGAGCG GGGCTACGTC GAGCAGGGCA AGCTCACCGA GCAGCAGCTC
GACGAGGCCC TCGACGTCCT CTCGATGACC CACCCGTGA
 
Protein sequence
MPAETQAAQQ DGTDYRIEHD SMGEVRVPAW AKWRAQTQRA VENFPISGTP IERELIGALA 
AIKGAAAAVN ASLGVLPQET ADAIGTAAAS VARGEWDEHF PIDVFQTGSG TSSNMNTNEV
IASLATEALG SPVHPNDHVN ASQSSNDVFP SAIHVAATRA IVRDLIPALQ HLEASLSRKA
EEFAEVVKSG RTHLMDATPV TLGQEFGGYA AAVRYGVERL QASLPRIGEL PLGGTAVGTG
INTPPGFAAA IIERLAAELD LPLSEARDHF EAQSSRDALV EGSGQLKTIA VGLVKIANDL
RWMGSGPRTG LGEIQLPDLQ PGSSIMPGKV NPVIPEAVIQ VSAQVIGNDA AVTFAGTTGV
FELNVTLPLM ARNVLESIRL LANASRILAD RCVDGIVANV DRCREYAESS PSIVTPLNKY
IGYEEAAKVA KQSLAEQKTI RQVVVERGYV EQGKLTEQQL DEALDVLSMT HP