Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4801 |
Symbol | |
ID | 8756502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 5012897 |
End bp | 5014528 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003411709 |
Protein GI | 284993154 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.18268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAGA TCACGCCGGG CAGTCGCGAC CAGGGCGACT CGCCCCCGCC CGGGTCCCCC GGCGCCTCGC CGAACGTCGG CCCGGTGCGC GACTCGGTCG CCGACCTGCC GGCCGGAGCG CCCACCGGGG CCGTCCGGGC GGCCGACGAG GCCCCCTCCT CGGTCGTCGG GGCGCGCCGT CCGCTCGACA TCGGCCAGGA CATCGACCTG CCGGCCGGCA CCGGGGCCGC CACCGTGGGG ACCTCGACCC TGCTCGGCCT GCCGGGTACG CGGCAGGCCG TGCAGCCGGC CTTCGACGAC CACCACCCGC CGGACCCCGC GCTCATCGGC GACTGCGTGC ACTGCGGGTT CTGCCTGCCG ACCTGCCCCA CGTACGTGCT GTGGGGCGAG GAGATGGACA GCCCCCGCGG GCGCATCTAC CTCATGAAGG AGGCGCTGGA GGGCGAGCCG CTCGACGACT CGATGGTGCG GCACTTCGAC CAGTGCCTGG GCTGCATGGC CTGCGTGACC GCCTGCCCGT CCGGGGTCCA GTACGACAAG CTCATCGAGG CCACCCGCCC GCAGATCGAG CGGCGGTACC AGCGGTCGCG GGCGGAGAAG TTCTACCGCG ACCTCATCTA CAACCTGTTC CCCTACCCCC GGCGGCTGCG CGTGCTCCGC GGGCCGCTGC GGGCCTACCA GGCCAGTCGG CTCGGCAGCC TGCTGACCCG CACCGGCCTG ATGAGCAAGC TGCCGGGCCC GTTGATGGCC ATGGAGTCGC TGGCGCCCAA GCTCGGTCCG GTGGAGCGCG TCCCCGAGCG GACGCCGGCC GTGGGGCAGC GGCGGGCGGT CGTCGGGCTG CTGGCCGGCT GCGTGCAGGG CACCTTCTTC CCCGACGTCA ACGCCGCCAC CGTGCGCGTG CTGGCCGCCG AGGGGTGCGA CGTCATCACG CCGAGGCGCC AGGGCTGCTG CGGCGCCCTG CCCGGCCACG GCGGCCGCGA GGAGCAGGCC CTCGACTTCG CCAAGCGGAC GATCGAGACC TTCGAGCAGG CCGGGGTCGA CTACGTCATC CTCAACGCTG CCGGCTGCGG CTCGAACGTC AAGGAGTACG GACACCAGCT GCGCGACGAG CCGGAGTGGG CCGAGCGCGC CGAGGCGCTC GCCGAGAAGG CCCGGGACAT CAGCGAGTTC ATCGTGGAGA TCGGCCCGGT CGCCGAGCGG CACCCGCTGC CCATGACCGT GGCCTACCAG GACGCCTGCC ACCTGGCCCA CGCGCAGGGC ATCCGCGAGG AGCCGCGGAA GGTGCTGCGC GGCATCCCGG GCATCGAGCT CAAGCAGCTC ACCGAGGCCG AGCTGTGCTG CGGCAGCGCC GGCACCTACA ACATGCTGCA GCCGGAACCG GCACGGGAGC TCGGGGAGCG CAAGGCGGCC GCCGTCCTCG CCACCGGGGC GGACCTGATG GTGACCGCCA ACCCCGGCTG CTGGATGCAG GTGGCGACCA CGCTGGCCCG GATGGGCAAG CGGATGCCGG TCGCGCACAC CGTCCAGGTG CTCGACGCCT CGATCCGCGG GGTGCCCGTC GAGGACCTGC TCGAGCGCGC GCTCACCGGT CCCGGCACCG CCCTGGCCCG ACCCTCCGCC ACGGACGGGT GA
|
Protein sequence | MTEITPGSRD QGDSPPPGSP GASPNVGPVR DSVADLPAGA PTGAVRAADE APSSVVGARR PLDIGQDIDL PAGTGAATVG TSTLLGLPGT RQAVQPAFDD HHPPDPALIG DCVHCGFCLP TCPTYVLWGE EMDSPRGRIY LMKEALEGEP LDDSMVRHFD QCLGCMACVT ACPSGVQYDK LIEATRPQIE RRYQRSRAEK FYRDLIYNLF PYPRRLRVLR GPLRAYQASR LGSLLTRTGL MSKLPGPLMA MESLAPKLGP VERVPERTPA VGQRRAVVGL LAGCVQGTFF PDVNAATVRV LAAEGCDVIT PRRQGCCGAL PGHGGREEQA LDFAKRTIET FEQAGVDYVI LNAAGCGSNV KEYGHQLRDE PEWAERAEAL AEKARDISEF IVEIGPVAER HPLPMTVAYQ DACHLAHAQG IREEPRKVLR GIPGIELKQL TEAELCCGSA GTYNMLQPEP ARELGERKAA AVLATGADLM VTANPGCWMQ VATTLARMGK RMPVAHTVQV LDASIRGVPV EDLLERALTG PGTALARPSA TDG
|
| |