Gene Gobs_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4555 
Symbol 
ID8756253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4780347 
End bp4781678 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_003411476 
Protein GI284992922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCA CCTACAACCC GGCCGACCCC TACGCCGGCT CCCGGGAGAC CACCGAGGGC 
CGCGTCTACA CCGTCACCGG CGGCGACTGG GACCAGACGC TGGGCACCGA GGCGTACGGC
GAGGAGCGGC TCGTCGTCAA CATGGGGCCG CAGCACCCCT CCACCCACGG CGTGCTGCGG
CTGGTGCTCG ACCTCGAGGG CGAGACGGTC ACCAAGGCCC GCGTGGTGAT CGGCTACCTG
CACACCGGGA TCGAGAAGAA CACCGAGTAC CGCAACTGGA CGCAGGGGAC GACGTTCGTC
ACGCGGATGG ACTACCTGTC CCCGCTCTAC AACGAGGCCG GCTACTGCAT GGCGGTCGAG
AAGCTGCTCG GCGTCGAGGC GCCGCAGCGG GCCCAGACCA TCCGCGTGCT GGTCATGGAG
CTCAACCGGA TCGCCTCGCA CCTGGTCGCG CTGGCCACCT TCGGCATGGA GATGGGCGCG
CTCACCGGGA TGACCAACGG CTTCCGCGAG CGGGAGCTCG TCCTGGACCT GCTCGAGGAG
ATCACCGGGC TGCGGATGAA CCACGCCTAC ATCCGCCCCG GCGGGCTGGC GCAGGACCTC
CCGCCCGGCG CGGTCGAGCA CATCCGGGAG TTCCTGCAGG TCATGCCGGA CCGGGTCGCC
GACTTCCACA AGCTGCTCAC CGGCCAGCCG ATCTGGCAGG CCCGGCTCAA GGACGCCGGC
TACCTCGACG TCACCGGCTG CGTGGCGATG GGCGTCACCG GGCCGGTGCT GCGCGCGGCC
GGGCTGCCGT GGGACCTGCG CAAGGTCGAG CCCTACCTGG GCTACGAGAC CTACGACTTC
GAGGTGCCGA CCGCCGACAC CTGCGACGCC TGGGGCCGCT ACCTGGTCCG CATGGCCGAG
GTGAACGAGT CGCTGAAGAT CATCGAGCAG GCGCTGGACC GGCTGGAGCC GGGGCCGGTC
ATGGTCGAGG ACAAGAAGAT CGCCTGGCCC GCGCAGCTGT CGCTGGGGCC CGACGGCATG
GGCAACTCCC TGGAGCACGT CAAGCACATC ATGGGGCAGT CGATGGAGGC CCTCATCCAC
CACTTCAAGC TGGTCACCGA GGGCTTCCGG GTGCCGGCCG GCCAGGTCTA CGTGCCCATC
GAGTCGCCCC GCGGCGAGCT GGGCTACCAC GTGGTCAGCG ACGGCGGCAC CAGACCGTGG
CGGGTGCACG TGCGCGACCC CAGCTTCGTC AACCTGCAGG CGACGGCGGC GATGAGCGAG
GGTGGCATGA TCGCCGACGT CATCGCCGCG ATCGCCTCGC TCGACCCGGT GATGGGCGGG
TGCGACCGAT GA
 
Protein sequence
MSTTYNPADP YAGSRETTEG RVYTVTGGDW DQTLGTEAYG EERLVVNMGP QHPSTHGVLR 
LVLDLEGETV TKARVVIGYL HTGIEKNTEY RNWTQGTTFV TRMDYLSPLY NEAGYCMAVE
KLLGVEAPQR AQTIRVLVME LNRIASHLVA LATFGMEMGA LTGMTNGFRE RELVLDLLEE
ITGLRMNHAY IRPGGLAQDL PPGAVEHIRE FLQVMPDRVA DFHKLLTGQP IWQARLKDAG
YLDVTGCVAM GVTGPVLRAA GLPWDLRKVE PYLGYETYDF EVPTADTCDA WGRYLVRMAE
VNESLKIIEQ ALDRLEPGPV MVEDKKIAWP AQLSLGPDGM GNSLEHVKHI MGQSMEALIH
HFKLVTEGFR VPAGQVYVPI ESPRGELGYH VVSDGGTRPW RVHVRDPSFV NLQATAAMSE
GGMIADVIAA IASLDPVMGG CDR