Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4952 |
Symbol | |
ID | 8756654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 5165022 |
End bp | 5166143 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF182 |
Protein accession | YP_003411853 |
Protein GI | 284993298 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGTGACG TGCTCGAGGA CCTGATCGGC TGGTGGCAGG CGGGCGAGAC CGTCGGCATG GGCACGGTGG TGGCCACCTG GCGCTCCGCC CCGCGCCCGG CCGGCGCGTC GATGCTCGTC GGCCCCGACG GCACCGCGGT GGGCAGCGTC TCCGGCGGCT GCGTCGAGGG CGCGGTGTAC GAGGAGGCCA GGGACGTCGT GGAGTCCGGC GTCCCCACCC TCGAGCGGTA CGGGGTCAGC GACGACGACG CCTTCGCCGT GGGGCTGACC TGCGGCGGGA TCCTCGACGT CTTCGTCGAG CCGGTGTCGC GGGAGTCCTT CCCCGAGCTC GGCGAGATCG CCGAGTCCGT GGACCGGCAC GAGCCGGTCG CCGTCGTCAC CTGCGTGGCC GGCCCGGAGG ACCGGGTGGG CCGGCGGCTG GTCGTCTGGC CGGACCGCAG CTCGGGCACC CTGGGCCTGC AGCGGCTGGA CGACGCCGTC GCCGACGACG CGCGCGGCAT GCTGGCCGCC GGCCGCACCG GGATGCTGCA CGTCGGGCAC GACGGCGAGC GCCGCGGCGA CGACCTGTCG CTGTTCGTCA ACGCCTTCGC GCCGGCCGCC CGGATGGTCG TCTTCGGCGC CATCGACTTC GCCGCCGCCG TCGCCCGGGT CGGCGCCTTC CTGGGCTACC GGGTCACCGT GTGCGACGCC CGCCCGGTGT TCGCGACGCC CAAGCGCTTC CCCGACGCGC ACGAGGTGGT CGTCGAGTGG CCGCACCGCT ACCTGCAGGG CGAGGTCGAC GCCGGCCGGA TCGACGAGCG CACCGTGCTG TGCGTGCTCA CCCACGACCC CAAGTTCGAC GTCCCGCTGC TGGAGGTCGC GCTGCGGCTG CCGGTCGCCT ACGTCGGCGC GATGGGGTCC AGGCGCACGC ACGAGGAGCG GCTCGCCCGG CTGGAGGAGG CCGGGCTGTC GAAGGAGGAG CTGGCCCGGC TGTCGTCGCC GATCGGGCTG GACCTCGGCG CCCGCACCCC CGAGGAGACG GCCATCTCGA TCGCCGCGGA GGTCATCGCC GGCCGCTGGG GCGGCTCGGG GGAGCGGCTG ACCGGCACCG AGGGGCCGAT CCACCGCACC GCCGACCGCT GA
|
Protein sequence | MRDVLEDLIG WWQAGETVGM GTVVATWRSA PRPAGASMLV GPDGTAVGSV SGGCVEGAVY EEARDVVESG VPTLERYGVS DDDAFAVGLT CGGILDVFVE PVSRESFPEL GEIAESVDRH EPVAVVTCVA GPEDRVGRRL VVWPDRSSGT LGLQRLDDAV ADDARGMLAA GRTGMLHVGH DGERRGDDLS LFVNAFAPAA RMVVFGAIDF AAAVARVGAF LGYRVTVCDA RPVFATPKRF PDAHEVVVEW PHRYLQGEVD AGRIDERTVL CVLTHDPKFD VPLLEVALRL PVAYVGAMGS RRTHEERLAR LEEAGLSKEE LARLSSPIGL DLGARTPEET AISIAAEVIA GRWGGSGERL TGTEGPIHRT ADR
|
| |