Gene Gobs_4801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_4801 
Symbol 
ID8756502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp5012897 
End bp5014528 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003411709 
Protein GI284993154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.18268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAGA TCACGCCGGG CAGTCGCGAC CAGGGCGACT CGCCCCCGCC CGGGTCCCCC 
GGCGCCTCGC CGAACGTCGG CCCGGTGCGC GACTCGGTCG CCGACCTGCC GGCCGGAGCG
CCCACCGGGG CCGTCCGGGC GGCCGACGAG GCCCCCTCCT CGGTCGTCGG GGCGCGCCGT
CCGCTCGACA TCGGCCAGGA CATCGACCTG CCGGCCGGCA CCGGGGCCGC CACCGTGGGG
ACCTCGACCC TGCTCGGCCT GCCGGGTACG CGGCAGGCCG TGCAGCCGGC CTTCGACGAC
CACCACCCGC CGGACCCCGC GCTCATCGGC GACTGCGTGC ACTGCGGGTT CTGCCTGCCG
ACCTGCCCCA CGTACGTGCT GTGGGGCGAG GAGATGGACA GCCCCCGCGG GCGCATCTAC
CTCATGAAGG AGGCGCTGGA GGGCGAGCCG CTCGACGACT CGATGGTGCG GCACTTCGAC
CAGTGCCTGG GCTGCATGGC CTGCGTGACC GCCTGCCCGT CCGGGGTCCA GTACGACAAG
CTCATCGAGG CCACCCGCCC GCAGATCGAG CGGCGGTACC AGCGGTCGCG GGCGGAGAAG
TTCTACCGCG ACCTCATCTA CAACCTGTTC CCCTACCCCC GGCGGCTGCG CGTGCTCCGC
GGGCCGCTGC GGGCCTACCA GGCCAGTCGG CTCGGCAGCC TGCTGACCCG CACCGGCCTG
ATGAGCAAGC TGCCGGGCCC GTTGATGGCC ATGGAGTCGC TGGCGCCCAA GCTCGGTCCG
GTGGAGCGCG TCCCCGAGCG GACGCCGGCC GTGGGGCAGC GGCGGGCGGT CGTCGGGCTG
CTGGCCGGCT GCGTGCAGGG CACCTTCTTC CCCGACGTCA ACGCCGCCAC CGTGCGCGTG
CTGGCCGCCG AGGGGTGCGA CGTCATCACG CCGAGGCGCC AGGGCTGCTG CGGCGCCCTG
CCCGGCCACG GCGGCCGCGA GGAGCAGGCC CTCGACTTCG CCAAGCGGAC GATCGAGACC
TTCGAGCAGG CCGGGGTCGA CTACGTCATC CTCAACGCTG CCGGCTGCGG CTCGAACGTC
AAGGAGTACG GACACCAGCT GCGCGACGAG CCGGAGTGGG CCGAGCGCGC CGAGGCGCTC
GCCGAGAAGG CCCGGGACAT CAGCGAGTTC ATCGTGGAGA TCGGCCCGGT CGCCGAGCGG
CACCCGCTGC CCATGACCGT GGCCTACCAG GACGCCTGCC ACCTGGCCCA CGCGCAGGGC
ATCCGCGAGG AGCCGCGGAA GGTGCTGCGC GGCATCCCGG GCATCGAGCT CAAGCAGCTC
ACCGAGGCCG AGCTGTGCTG CGGCAGCGCC GGCACCTACA ACATGCTGCA GCCGGAACCG
GCACGGGAGC TCGGGGAGCG CAAGGCGGCC GCCGTCCTCG CCACCGGGGC GGACCTGATG
GTGACCGCCA ACCCCGGCTG CTGGATGCAG GTGGCGACCA CGCTGGCCCG GATGGGCAAG
CGGATGCCGG TCGCGCACAC CGTCCAGGTG CTCGACGCCT CGATCCGCGG GGTGCCCGTC
GAGGACCTGC TCGAGCGCGC GCTCACCGGT CCCGGCACCG CCCTGGCCCG ACCCTCCGCC
ACGGACGGGT GA
 
Protein sequence
MTEITPGSRD QGDSPPPGSP GASPNVGPVR DSVADLPAGA PTGAVRAADE APSSVVGARR 
PLDIGQDIDL PAGTGAATVG TSTLLGLPGT RQAVQPAFDD HHPPDPALIG DCVHCGFCLP
TCPTYVLWGE EMDSPRGRIY LMKEALEGEP LDDSMVRHFD QCLGCMACVT ACPSGVQYDK
LIEATRPQIE RRYQRSRAEK FYRDLIYNLF PYPRRLRVLR GPLRAYQASR LGSLLTRTGL
MSKLPGPLMA MESLAPKLGP VERVPERTPA VGQRRAVVGL LAGCVQGTFF PDVNAATVRV
LAAEGCDVIT PRRQGCCGAL PGHGGREEQA LDFAKRTIET FEQAGVDYVI LNAAGCGSNV
KEYGHQLRDE PEWAERAEAL AEKARDISEF IVEIGPVAER HPLPMTVAYQ DACHLAHAQG
IREEPRKVLR GIPGIELKQL TEAELCCGSA GTYNMLQPEP ARELGERKAA AVLATGADLM
VTANPGCWMQ VATTLARMGK RMPVAHTVQV LDASIRGVPV EDLLERALTG PGTALARPSA
TDG