Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3910 |
Symbol | |
ID | 8755595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 4096830 |
End bp | 4098398 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF349 |
Protein accession | YP_003410850 |
Protein GI | 284992296 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGAGCA CGGAGAACGC CGGAAACTCC TCGCAGCAGC CGCCCCAGCC CGGGACGGCC GGATCCGAGG CGGTCGCCCC CGAGGCGGGG ACGGCGGTGG CGCCCAGCCC CGGCGACGCC GCTCTCGTCG ACCCGACCGC GGGGGCCGCC ACCGAACAGG CCCCGTCGCA GGCCGCGCCG GAGACGCCCG CACCCGAGGC CTCCACTCCG GAGGCGCCCA CACCCGAGGC TCCCGGTCCC GAGACCGCCA TTCCCGAGTC CGCCACCCCG GGAACGCCCG CACCGCAGCA CGCCGTCCCG GAGCACGCCG TCCCGCAGCA CGCCGTCCCG CACCCGCCCG CCGACGCCGC CCCGGCGTCC GACCCCGCCC AGTGGGGCCG GGTGGACGAG GACGGGACGG TCTACGTGCG CACCGCCGAG GGCGAGCGCG CGGTCGGCTC GTGGCAGGCC GGCGACCCGG CCTCGGGGCT GGCCCACTAC GCCCGCCGCT ACGACGACCT GGCCACCGAG GTCACCCTGC TCGAGGCGCG GCTGCGGGCG CACACCGGCA ACCCCAGCGA GATCAAGGCC AAGGCGCAGG CGCTAGCCGA GTCCATCCCC ACCGCCACCG CCGTCGGCGA CCTCGACGGG CTGGCCGCCC GCGCCCGGGC GATGGTGGGG ACCGCCGACT CCGCGGCCGC CGAGTCCCGG GCGGAGAAGG CCGCCGCACG GGCCGCGCAG GTCGCCCGCA AGGAGGCGCT GGCCGTCGAG GCCGAGCAGA TCGCCGCCGA GTCCACCTCG TGGAAGGCCG CCGGGGACCG GCTCAAGGCC ATCGTCGAGG AGTGGAAGAC CATCCGCGGC ATCGACCGCA AGACCGACGA GGCGCTGTGG AGCCGCTTCG CCGCCGCCCG TGACGCCTTC GGTCGCCGCC GGGGCGCGCA CTTCGCCGCC CTCGACGCCC AGCGCGGCGA GGCCCGCGCG GCCAAGCAGG AGCTCATCAA CGAGGCGCAG CGGCTGTCCA CGTCGACCGA GTGGGGTCCC ACGAGCGCCG CCATGCGCTC GCTGATGGAC CGCTGGAAGG CCGTGCCGCG CACCGGCCGC GACGGCGACG ACGACCTGTG GAAGCAGTTC CGCGCCGCGC AGGACGTCTT CTTCAGCGCC CGCGCCGAGT CCGACAAGGC CCGCAACGCC GAGCAGCTGG CCAACCAGCA GGCCAAGGAG GAGATCCTCG CCGAGGCCGA GAAGCTCGAC CCGTCCAGCG ACCTGCGCGG AGCCCAGAAC GTGCTGCGCA AGCTGCAGGA GCGCTACGAC GCCATCGGCC ACGTGCCGCG CGGCGCCATG CGCCAGCTGG AGGACCGGAT GCAGGCCGTC GAGCAGCGCA TCCGCGGCGC CGTCGACACC TCCCGCCCGC GGGTGGCTCC GGAGAACCCG ATGGTCACCT CAATGCGGCA GGCGGTGACC AAGGCCGAGG AGCAGCTCGC CAAGGCCGAG GCCGCCGGCG ACGGCCGCCG GATCGAGGAG GCCCGGGCGA ACCTGGCCAC CCGCCAGGAG TGGCTGGCCG AGGCCGAGAA GAGCGCCAAC CGCCGCTGA
|
Protein sequence | MSSTENAGNS SQQPPQPGTA GSEAVAPEAG TAVAPSPGDA ALVDPTAGAA TEQAPSQAAP ETPAPEASTP EAPTPEAPGP ETAIPESATP GTPAPQHAVP EHAVPQHAVP HPPADAAPAS DPAQWGRVDE DGTVYVRTAE GERAVGSWQA GDPASGLAHY ARRYDDLATE VTLLEARLRA HTGNPSEIKA KAQALAESIP TATAVGDLDG LAARARAMVG TADSAAAESR AEKAAARAAQ VARKEALAVE AEQIAAESTS WKAAGDRLKA IVEEWKTIRG IDRKTDEALW SRFAAARDAF GRRRGAHFAA LDAQRGEARA AKQELINEAQ RLSTSTEWGP TSAAMRSLMD RWKAVPRTGR DGDDDLWKQF RAAQDVFFSA RAESDKARNA EQLANQQAKE EILAEAEKLD PSSDLRGAQN VLRKLQERYD AIGHVPRGAM RQLEDRMQAV EQRIRGAVDT SRPRVAPENP MVTSMRQAVT KAEEQLAKAE AAGDGRRIEE ARANLATRQE WLAEAEKSAN RR
|
| |