Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4702 |
Symbol | |
ID | 8756400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 4917624 |
End bp | 4918664 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | protein of unknown function DUF201 |
Protein accession | YP_003411620 |
Protein GI | 284993066 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0701331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCCC CACTGCCCCA GCCCCGCCCG CCCGCCGCCG TCCCGCTCCC CGGCGACGCC GCCGTGCTCG TCACCGGTTC CGGTGGCCCC GCCGGCGTGG CCGTCGTCCG CCGGCTGGTC GCCCTGGGCC ACCGGGTCGT CGCCGGCGAC GCGGACAGCG CCGCTGCCGG CGGCGCGCTG GCCGACGTGG CGGTCACCCT GCCGCGCGGC GACCACCCCC GCTTCGTCGA GGCCGTGGTC GGCGTCTGCA GCACCTTCGG GGTGGACGCG CTGGTCAGCA CGGTCGCCGA GGAGCTGCCG CACCTGGCCG CCGGTGCCGG TCGGCTGGCG GCCGCCGGCG TCGCCACCTG GCTGCCGGAC GCCGCTGCCG TCGACGTGTG CTGCGACAAG GCCGCCTTCG CCCGGGCCAT GGCCGCCGCG GGCGTGCCGC ACCCGGCGAC CGCGGCGTCC CCCGGCGAGG TCGCCGGCGT CCCCGGCCCG TGGGTGGTCA AGCCGCTGGC CGGCCGCGGC AGCCGCGGCG TGCGGCTGCT CGACGAGCGG GACGACGTGG TCGCGGCGCT CCGCGCGGGC GACGGCCTGA TCGCCCAGAC CCGGCTGGCC GGCCGCGAGT TCACCGCGGA CGCCCTGGTC GACCGGGACG GCGCCGTGCG CGTGGTCGTG CCGCGCTGGC GCGAGGAGAC CAAGGCGGGC ATCTCGGTGA AGGGCCGCAC CTTCGCCTCC GACGCGGTCA CCCGGGTCGT CACCGCCGCG CTGGCCGCGG TCGAGCTGAC CGGGCCGGCC AACGTGCAGG GCTTCGTCGC CGACGACGGC ACCGCGACCG TCGTGGAGAT CAACCCGCGG TTCTCGGGCG GCCTGCCGCT GACCCTGGCC GCCGGGGCGG ACGTCGTCAG TGCGTACCTG GCCGGCGTCC GCGGGAAGCC GCTGCCGGAG CTCTCGTCCC GGCCGGGAGT CGCGATGAGC CGCTACTTCG CCGAGACGTA CAGCAGCGAG GACGGCAGCC CCGTCGTCGA CCCCTGCGCG CCCGTGATGG TGACGGCGTG A
|
Protein sequence | MTPPLPQPRP PAAVPLPGDA AVLVTGSGGP AGVAVVRRLV ALGHRVVAGD ADSAAAGGAL ADVAVTLPRG DHPRFVEAVV GVCSTFGVDA LVSTVAEELP HLAAGAGRLA AAGVATWLPD AAAVDVCCDK AAFARAMAAA GVPHPATAAS PGEVAGVPGP WVVKPLAGRG SRGVRLLDER DDVVAALRAG DGLIAQTRLA GREFTADALV DRDGAVRVVV PRWREETKAG ISVKGRTFAS DAVTRVVTAA LAAVELTGPA NVQGFVADDG TATVVEINPR FSGGLPLTLA AGADVVSAYL AGVRGKPLPE LSSRPGVAMS RYFAETYSSE DGSPVVDPCA PVMVTA
|
| |