Gene Gobs_4809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4809 
Symbol 
ID8756510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp5023027 
End bp5025315 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content69% 
IMG OID 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003411717 
Protein GI284993162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCCCT TGCAGGTCGT CCTCGGGACG ATCAGCGTCA TCTTCCTGAT CGTCTCGGTG 
GTGTTCGCCT ACCGGGCGGT CCGGGCCATG GTCCGCATCA TCAGGACCGG GCAGCCGGAC
AGCACCCGCA ACGGCCCGGT GGGCCCGCGG CTCAAGACGC TGCTCGTGGA GTCGCTCGGC
CACACCCGGA TGCTCAAGTG GTCGTCGATC GGCGTCGCCC ACTGGTTCGT GTTCCTGGGC
TTCTACGGCC TGTTCCTCAC CCTGGTCGAG GCCTTCGGCG AGGTCTGGAA CCCGGCCTTC
CACCTGCCGC TCGTCGGCGA GCTGGGCGTG TGGAACCTCT TCGCCGACAT CATCGGCACC
GGCACGGTCC TCGGGATCCT CTACCTGATC TACCGACGCC AGAAGGACCA CCCGCGCCGG
CAGGGCCGCA CCAGCCGTTT CGCCGGCTCC AACGAGGGCC GCGGCTACTT CGTCGAGGCC
GTCGTCCTGA TCGTCGGCGT CTGCATCCTG CTGATCCGCG GGCTCAAGGT CTCCTCGGAC
CTCACCGACG CCCCGGCCTG GTCGCATCCG GTCTCCGGCC TGCTGGCGAC CGTCCTGCCG
AACAGCCCGG ACCTGCTGAG CGTCGTCGCG TTCGTCAAGA TCGTCATATC GCTGGTCTGG
GTCGTGGTCC TCAGTCGCAC CCTGAACATG GGCGTCGCCT GGCACCGGTT CAGCGCCTTC
CCGAACATCT ACTTCAAGCG CCACGACGAC GGCAGCGTCG CCCTCGGCCC GCTGCAGGCG
ATGACCTCCG GCGGCAAGCC GATCGACTTC GAGGACCCCG ACGAGGACGC GATCTTCGGC
CGCGGCAAGA TCGAGGACTT CACCTGGAAG GGGATGCTCG ACTTCACCAC CTGCACCGAG
TGCGGGCGGT GCCAGAGCCA GTGCCCGGCG TGGAACACGG GCAAGCCGCT CAGCCCCAAG
ATGGTGGTGA TGGACCTGCG GGACCACCTG TACGCCAAGG CCCCGTACCT GCTCGGTGAG
AAGACCGCGA GCGAGACCGC ACCGGACTAC GACGTCCTCA AGCCGAGCGA CGAGCAGGTC
TGGGCCTCGG GCTACGCCCG CATCGAGGGC ACCAACGACG CGCAGGCCCA CCGCCCGCTC
GTCGGCACCC TCGAGGAGGG CGGGGTCATC GACCCCGACG TGCTGTGGAG CTGCACCAAC
TGCGGCGCCT GCGTGGAGCA GTGCCCGGTC GACATCGAGC ACATCGACCA CATCGAGGAC
ATGCGCCGTT ACCAGGTGCT CATCGAGTCC AACTTCCCCT CCGAGGCCGG CGTGATGCTG
CGCAACCTGG AGAACCGCGG CAACCCGTGG GGCGTCCAGG CCAGCACCCG CGAGGACTGG
ATGAAGGACC TGGACTTCGA GGTCCGGCAG GCCGACGGCC CACTGCCCCT CGACGTCGAG
TACCTGTTCT GGGTCGGCTG CGCCGGCGCC ATCGACGACC GGGCCAAGAA GGTCACCAAG
GCCGTCGCGG AGCTGCTGCA CACCGCCGGT GTCGAGTTCG CCGTCCTGGG CTCGGGCGAG
ACCTGCTCGG GCGACCCGGC CCGCCGCATG GGCAACGAGT TCGTCTTCCA GATGCTCGCC
CAGGAGAACG TCGAGACGCT CAACGGCGTC TTCGAGGGCC GCGTGCCGGG CATGCGCAAG
ATCGTCACCA CCTGCCCGCA CTGCTTCAAC TCGCTGGGCC GGGAGTACCC ACAGGTCGGC
GGCGACTACG AGGTGGTCCA CCACACCCAG CTGCTCAACC AGCTGGTGGC CGACGGCCGG
CTGACCCCGG TCACGCCGGT CGACCGCAAG GTCACCTACC ACGACCCGTG CTTCCTCGGC
CGGCACAACA AGGTCTACAC ACCGCCGCGG GAGATCCTCG AGTCCGTCCA GGGGCTCTCC
ACCCAGGAGA TGCACCGCTG CAAGGACCGC GGCTTCTGCT GCGGCGCCGG CGGCGCCCGG
ATGTGGATGG AGGAGAAGAT CGGCAAGCGG ATCAACATGG AGCGCACCGA GGAGGCGCTC
GACCTCGATC CGGACGTCAT CTCCACCGCG TGCCCGTTCT GCATCACGAT GCTCAGCGAC
GCGCTGACGA CCAAGAAGCA GAACGGCGAG GCCGGCGAGC ACGTCGAGGT GCTCGACGTC
AGCCAGATCC TGCTCCGTTC GCTGGCCCCG GTCGGCACGC CCGGCACCGA GCACGGCGCG
GAGGTCGGGA CCGAGACCGA CGACGTGGCC GGCACCGGCT CGGCGGCGAC GGAGCACGGC
GCGCAGTAG
 
Protein sequence
MGPLQVVLGT ISVIFLIVSV VFAYRAVRAM VRIIRTGQPD STRNGPVGPR LKTLLVESLG 
HTRMLKWSSI GVAHWFVFLG FYGLFLTLVE AFGEVWNPAF HLPLVGELGV WNLFADIIGT
GTVLGILYLI YRRQKDHPRR QGRTSRFAGS NEGRGYFVEA VVLIVGVCIL LIRGLKVSSD
LTDAPAWSHP VSGLLATVLP NSPDLLSVVA FVKIVISLVW VVVLSRTLNM GVAWHRFSAF
PNIYFKRHDD GSVALGPLQA MTSGGKPIDF EDPDEDAIFG RGKIEDFTWK GMLDFTTCTE
CGRCQSQCPA WNTGKPLSPK MVVMDLRDHL YAKAPYLLGE KTASETAPDY DVLKPSDEQV
WASGYARIEG TNDAQAHRPL VGTLEEGGVI DPDVLWSCTN CGACVEQCPV DIEHIDHIED
MRRYQVLIES NFPSEAGVML RNLENRGNPW GVQASTREDW MKDLDFEVRQ ADGPLPLDVE
YLFWVGCAGA IDDRAKKVTK AVAELLHTAG VEFAVLGSGE TCSGDPARRM GNEFVFQMLA
QENVETLNGV FEGRVPGMRK IVTTCPHCFN SLGREYPQVG GDYEVVHHTQ LLNQLVADGR
LTPVTPVDRK VTYHDPCFLG RHNKVYTPPR EILESVQGLS TQEMHRCKDR GFCCGAGGAR
MWMEEKIGKR INMERTEEAL DLDPDVISTA CPFCITMLSD ALTTKKQNGE AGEHVEVLDV
SQILLRSLAP VGTPGTEHGA EVGTETDDVA GTGSAATEHG AQ