Gene Gobs_3837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_3837 
Symbol 
ID8755522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp4020229 
End bp4023321 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content81% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003410781 
Protein GI284992227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGGTT CCCCCGATGC CCCGGGCGCG CCCCGTCCTG ACGTCGCCCC CGGAGCGGCC 
GACCCCTTCG CCACCGCGGA GCTGCGGCGG CGGGTGCTCG CCGCCTGGGC CGACTCCCCG
GCCCGCTTCC GCGAGGACGC CAACGCCGAG GAGGACCTCG TCCGCGGCGG CTACCGCGAC
CGGCTGTTGG TGGAGCTGGC CCAGAACGCC GCCGACGCCG CCGCCCGGGC AGGGCTGGCG
GGGCGGCTGC GGCTGGAGCT GACCGGCGAC GGCGCCGGGG CGGAGGTGCT GCGGGCGGCC
AACACCGGCG CCCCGCTCGA CGCCGCCGGG GTGCAGGGCC TGGCCTCGCT GCGCGCCTCG
GCCAAGCGCG ACGAGCCGCG GTCCGGAGGC CGGGCCGGAG TCGGGGGCGG ACCGCGGGTC
GAGACGGTCG GCAGGTTCGG GGTCGGCTTC GCCGCCGTCC TCGCGGTCAC CGACGAGCCG
GCGGTGCACT CGACCGCCGG TGGGGTCGCG TTCAGCACCG GTCGCACCCG CGCCGAGGTC
GCCGCCGTCC CCGCGCTCGC CGAGGAGGTG GCCCGCCGGG ACGGCGCGGT GCCGGTGCTG
CGACTGCCCT GGCCGGCCGG GGGGACGCCA CCCGAGGGCT TCGCCACCGA GGTGGTGCTG
CCGCTGCGCG CCGGGTCGCG GGCCGCCGTG CGCACCGCCC TGGAGGAGCT GGCGCCCGAG
CTGCTGCTGG CGCTGCCCGG GCTCGCGGCG GTCGACGTCG TCCTCGACGG GACGGTCCGC
TCGCTGACGG TGCGCCGGCG CGGCGCGGAG GCCGAGCTGA CCGACGGCGA CCGGACGACG
GTCTGGCGGC TGGAGCGGCG CGACGGCGAG CTGCCCGAGG CGCTGCTCGC CGAGCGGCCG
GTCGAGGAGC GGGGACGGCG GGCCTGGACG CTGACCTGGG CGGTGCCGCT GGACGACGAC
AGGCGGCCGG CGCCCCTGCC GCTGCCCCAG CGGGTGCACG CGCCGACCCC CAGCGACGAG
CCGCTCACCC TGCCCGCCCG CCTGGTCGCG CCCTTCCCGC TGGGCCCCGA CCGGCGGCAC
GTGCTGCCCG GGACGGTCAC CGACGAGCTG GTCGCCGCGG CGGCCGGCGC CTACGCCGAC
CTCCTCTCGG GGCTGGCCGA GGGCCCGGGG AGGGACGGCG CGGTGCTGGC GCTGGTGCCG
CGCATCGGGC TGGCGGGCGC CGAGCTCGAC GCCGCGCTGT GCACCGCGGC GCTGGAGCGG
CTGCGGGAGT CCGCCTGGCT GCCGGTCGCC GGCGCGGACC GCGACCGGCA GCCGCCCGGC
CGCGCCGCGG TGCTCGACGA ACCGACCGAG GAGCGGGTGG CCGCGCTGAC CGGCGTCCTC
CCCGGGCTGC TGCCGGCGGG CTGGTCCGCC CGGACCCAGC TGCCCGCGCT GGCCGCCCTC
GGGGTGCGCC GGGTCGGCAC CGCCGAGGCC GTGGAGGCGG TCCGCGGCGT CGCCCGGCCG
CCGTCGTGGT GGGCGGGGCT GTACGCGGCC CTCGACGGCG CCGAGCGCGA GGAGCTGGCC
GCGCTGCCGG TCCCGCTCGC CGACGGCCGC ACCGCCCACG GGCCGGCGGG GGTGCTGCTA
CCCGACGAGC AACTGCCGGT GGCGCGGCTG GGTCCGCTCG GGCTGCGGGT GGCCGACCCG
GACGCGGTCG CCCCCGCCGC CGCCCGCCGG CTGCTGGAAC GGCTCGGGGC GCGCCCGGCC
ACCGCCGCCG CGGTGCTCGC CGACCCGGAG GTGCGCGCGG CGGTCGAGGC CTCGGTCGAC
GCGGCCGACG ACGGCTGGGA CACCGACCGG GACCCGGCCG CGTTCGCCGA CGCGGTCCTG
GCGCTGGTCG CCGCGGCCGG GACCGGGCCG GGGGAGCTGC CCTGGCTCTC CGAGCTGGCG
CTGCCCGACG CCGACGGTGG CTGGGCGCCG GCCGGGGAGC TGGTGCTGCC GGGGTCCCGC
TTCGCCGGCG TGCTGCTCGA CGGCGCGCTG GGCACCCTGG CCCCCGCGCT GGACGCCGAC
CCGGACGTGC TGCGGGCGGT GGGCGTGCTC GACGGCTTCG CGCTGGTGCA CGCCGAGGAC
CCCGACGACC TCGACGTCGA CGCAGCGCAG GACTGGGCCG ACGCCGTCCT CGACCGGCTG
CCCGCCGACG CGCCGGCTCC GTCGTGGCCG CCGCTGACCG CCGTCCGGGA CCTCGAGCTG
GTCGCCGACT GGACCGGCGC GCTGCCGCTG CTGGCCGCGC TGCCGGCGGA CGCGTGGGCC
GACGTCGTCC TCGACGGGGT GGCGGTGCCC GGCTACCTGC GGTGGTGGCT GGGCGGCCAC
CCGGTGCTCG GCGGGCGGCG CCCGGACCGG CTGCGGCACC CGGAGGCGGC GGAGCTGCAG
GGACTGTACG AACCGGTCGA CGCCGATCCG CGGCTGCTGG AGCTGCTGCG CCCGCCGGCG
TCGGTCGACG ACGTGCTCGC CGACGTCGAC GGCGCGCTCG ACCTGCTCGA CCGGCTCGGC
GACGACCGCC GCACCGTCTC GCCCGCCGTG CTGCGCACCG TGCACGCCCG GCTGGCCGCG
GCGCTCGACG GCGTCGCCGT GGACCCGCCG GACCGGGTGC GCGTGGCGCC GGACCGGGTG
GTCCCCGCCG AGCGGGCCGT CGTCCTGGAC GCCCCGTGGT TGCAGCCACT GGTCGACTCC
CCGCTCGTAC CGGCCGGTGG CGCCCCGGGG CCGGTGGCCG ACCTGCTGGA CCTCCCGCTG
GCCGGCGAGG TGGTGCGCGC CCGGGTGGAG AGCCGTCCGG CACGGCGGGT GCGCTGGGCC
GACGTCCCGG GGGCGCAGCA GGCCGCGGCC CGGCTCGGGC TGGCGGAGCT CCCCGGGGAG
GTCGCCGTCC ACGAGCCGCT GCGGGCCGGT GGCCGGGCGG TGCCGTGGTG GCCGGGGGAC
GGGGTCGACC ACGTGGACGG GACCCCGGGT GCGCTCGGGC GGGCGCTGGC CTGGCGGACG
GGGGCGTGGT CGCTGCGCCA GGCGCTAGCC GAGGCCTTCG CCGATCCGGA GCGCGCCGAC
GTGCTCGCCG CCGAGGACGC CGTCGGCCCG TGA
 
Protein sequence
MSGSPDAPGA PRPDVAPGAA DPFATAELRR RVLAAWADSP ARFREDANAE EDLVRGGYRD 
RLLVELAQNA ADAAARAGLA GRLRLELTGD GAGAEVLRAA NTGAPLDAAG VQGLASLRAS
AKRDEPRSGG RAGVGGGPRV ETVGRFGVGF AAVLAVTDEP AVHSTAGGVA FSTGRTRAEV
AAVPALAEEV ARRDGAVPVL RLPWPAGGTP PEGFATEVVL PLRAGSRAAV RTALEELAPE
LLLALPGLAA VDVVLDGTVR SLTVRRRGAE AELTDGDRTT VWRLERRDGE LPEALLAERP
VEERGRRAWT LTWAVPLDDD RRPAPLPLPQ RVHAPTPSDE PLTLPARLVA PFPLGPDRRH
VLPGTVTDEL VAAAAGAYAD LLSGLAEGPG RDGAVLALVP RIGLAGAELD AALCTAALER
LRESAWLPVA GADRDRQPPG RAAVLDEPTE ERVAALTGVL PGLLPAGWSA RTQLPALAAL
GVRRVGTAEA VEAVRGVARP PSWWAGLYAA LDGAEREELA ALPVPLADGR TAHGPAGVLL
PDEQLPVARL GPLGLRVADP DAVAPAAARR LLERLGARPA TAAAVLADPE VRAAVEASVD
AADDGWDTDR DPAAFADAVL ALVAAAGTGP GELPWLSELA LPDADGGWAP AGELVLPGSR
FAGVLLDGAL GTLAPALDAD PDVLRAVGVL DGFALVHAED PDDLDVDAAQ DWADAVLDRL
PADAPAPSWP PLTAVRDLEL VADWTGALPL LAALPADAWA DVVLDGVAVP GYLRWWLGGH
PVLGGRRPDR LRHPEAAELQ GLYEPVDADP RLLELLRPPA SVDDVLADVD GALDLLDRLG
DDRRTVSPAV LRTVHARLAA ALDGVAVDPP DRVRVAPDRV VPAERAVVLD APWLQPLVDS
PLVPAGGAPG PVADLLDLPL AGEVVRARVE SRPARRVRWA DVPGAQQAAA RLGLAELPGE
VAVHEPLRAG GRAVPWWPGD GVDHVDGTPG ALGRALAWRT GAWSLRQALA EAFADPERAD
VLAAEDAVGP