Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4455 |
Symbol | |
ID | 8756149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 4689455 |
End bp | 4690492 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_003411376 |
Protein GI | 284992822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTGACC AGCCGTTGGT GCTCGGGTTC GAGACCTCGT GCGACGAGAC CGGCGTCGGC CTGGTGCGCG GGCGCACGCT GCTGTCCGAC GCGCTGGCCA CCTCGGTGGC CGAGCACGAG CGCTTCGGCG GGGTGGTGCC CGAGATCGCC TCGCGGGCGC ACCTGGAGGC GATGGTGCCG ACCGTGCACC GCGCGCTGGC CGAGGCCGGG GTGCGCACCT CCGACGTCGA CGCCGTCGCG GTGACGGCGG GGCCCGGGCT CACCGGTGCG CTGCTGGTCG GGCTGGCCGC GGCCAAGGCG TACGCGCTGG CGCTGGACAA GCCGCTGTAC GGCGTCAACC ACCTGGCCGC GCACGTGGCC GTCGACGAGC TGCAGCACGG CAGGCTGGCC GAGCCGTCGC TGGCCCTGCT GGTGTCAGGT GGGCACAGCT CGCTGCTGCT GGTGCCCGAC CTAGCCCGCG AGGTGCAGTC GCTGGGCCGC ACCATCGACG ACGCGGCGGG GGAGGCCTTC GACAAGGTGG CCCGCGTGCT CGGCCTGCCG TTCCCCGGCG GCCCGCCGAT CGACAGGGCC GCGCGCGAGG GGAACCCGGC GGCGATCGGC TTCCCGCGCG GGCTCACCGG TCCCCGCGAC GCCCCCTACG ACTTCTCGTT CTCCGGGCTG AAGACCGCCG TCGCCCGGTG GGTCGAGGCC CGGCAGCGGG CGGGGGAGCC GGTGCCGGTG GCCGACGTCG CGGCGTCGTT CCAGGAGGCG GTCGCCGACG TGCTGACCGC CAAGGCGGTG CGCGCCTGCC GCGACCACGG CGTCGACCAC CTGGTGCTCG GCGGCGGCGT GGCGGCCAAC TCCCGGCTGC GCGCCCTGGC CGAGGAGCGG TGCGCCGCGG CCGGCATCGT GCTGCGGGTG CCCAGCCCGC GGCTGTGCAC CGACAACGGC GCGATGGTCG CCGCACTCGG CTCGCGGCTG GTCGAGGCCG GGGTCGCGCC GTCGGCCCCG GACGTCGGCG CGGACAGCTC CCTGCCGATC GACGTCGTCA CCCGCTGA
|
Protein sequence | MVDQPLVLGF ETSCDETGVG LVRGRTLLSD ALATSVAEHE RFGGVVPEIA SRAHLEAMVP TVHRALAEAG VRTSDVDAVA VTAGPGLTGA LLVGLAAAKA YALALDKPLY GVNHLAAHVA VDELQHGRLA EPSLALLVSG GHSSLLLVPD LAREVQSLGR TIDDAAGEAF DKVARVLGLP FPGGPPIDRA AREGNPAAIG FPRGLTGPRD APYDFSFSGL KTAVARWVEA RQRAGEPVPV ADVAASFQEA VADVLTAKAV RACRDHGVDH LVLGGGVAAN SRLRALAEER CAAAGIVLRV PSPRLCTDNG AMVAALGSRL VEAGVAPSAP DVGADSSLPI DVVTR
|
| |