Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4368 |
Symbol | |
ID | 8756062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 4593398 |
End bp | 4596520 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | LGFP repeat protein |
Protein accession | YP_003411295 |
Protein GI | 284992741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCCAGG CCGAGGTCGT CGTTCCGGCA CAGGTCGCCC CGGCGGACCC GGCACCGCAG ACGACGCCGA CCAACGAGGT CACCGTCGTC CTCGTGGCGC CGGCCGGCTC GGTCGAGGAC GGGACCCGGC TCGCCGACGT CGTCGACGTG GTCGACACCC AGGTCCGCGA CTTCTGGGCC GAGCAGACCG ACGGCGCGGT CCGGTTCGAC GCCGTTGCCG GGGCGGACGG GTGGGTGCAC ACGGCCGCCG ACTGCACCGA CCCCTTCCGG CTCTGGGAGG AGACGGCGGC GACGATCGGC TGGGAGCGCG GACCGGGCAA GCACCTGCTG CTGTACGTCA CCAGCACGCC GGAGGAGCTG GACGGCTGTG GCTACGGCCT GGCCGAGATC GGCGCCGGCC CGATGGCGGG CGGCTACGGC TACGTCCGCG ACGTCCGGCT GTCGGTCATG GCCCACGAGC TCGGGCACAA CCTCGGTCTC GGCCACTCCT CGGCGCTGCA GTGCGATCCC GAGGGCGTGT GCCGCGTGCA GCCGTACTGG GACTGGTACG ACGTCATGGG CGTCTCGTGG GAGCAGGTCG GCACGCTGAA CCCGGCGCAG GCGGACTGGC TCGGGCTCGC GCCGGCCACC TCGATCGCCG AGATCGCCGC CGGCCAGCCG GCGGCCACCT TCACCCTGTC CGCCGCCTCG CAGCGGTCCG GGACCCGGGC GGTCCTGCTG CGGGACGGTG ACCTGCGCTT CTGGCTCGAG TGGCGGCCCC CGACCGGCCG GGACGCCTGG CTGGGCACCA CCGCCAACCG TCCGAGGGTG CAGCCCGGTG TGCTGCTGCG CCGGACCTCC TACAGCCCGG ACAGCTCCTA CCTCTACGAC CCGACGCCCT CCCCGCGCTC GACCTGGAAC CGCGACCTGC AGACGGCGCT GCCGTTGAAC CGGCCGGTCC CCGTGGCCGA CGGCCTGTTC ACGGTGACCG TGCAGAGCGT GTCGGACACC GGCGTCACCG TGCGCGTCGT GCCCGGGACC AACGCCACCA CCGGCGCCAT CCAGACCGCC TACCAGCGGC CGGGCGTCGC CGACGACCTC GGGTCGCCCA CCGCGCCGCA GACCTGCGGC CTGCCCGGCG GCGGCTGCCG GCGCACCTAC GAGGGCGGCG CCATCTACTG GTCCGCGGCG ACCGGCGCGC GCGTGGTCAC CTCTCCGGTG CTGGAGGAGT ACCTGGCCCT CGGCGGGCCG GCGGCGATGG GCTACCCGCT CAAGGACGAC GCCGTCCGCT TCTGGCCGAC GACGGCCGGG CTGGTCGGCA CCTTCCCCGG TGGTGACATC GTCTGGACGC CGGAGTTCGG CGCGCACGTC GTCCGGGGCG CGATCCGGGA CCGCTGGCGT GCCGCCATCG ACGCGCTCGG CCCACCGACC GCGTCGGAGT CCGCCGTCGC CGGGGGCTAC GCGCTGCCCT TCCTCGGCGG GACGGTCTAC TGGTCGCCGT CGACCGGGGC GCGGATGGTG CGCGGGGCGA TCCTGCAGCG CTACGAGGCC GCCGGGGGGC CGCGGGCGCT GGGCTTCCCG ATCGCCGACG ACGGCGGCAC CGCCGACGGC ACCGGTGCGC TCGTCCGGCT GCAGGGCGGG GTCATCTACT GGTCGGCCCG CACCGGCGCG CACGACGTGC GCGGGGCCAT CCTGGAGCGC TGGCGGTCGC TGGGCGCGCA GACCGGGGCG CTGGGGTACC CGATCGGGGA CGACGTCGCG GTGCCCGGCG GGTGGAAGAC CGACTTCGCC GGCGGCTCGA TCTACTGGTC GCCGTCGACC GGTCCGCGGA TGGTGCGCGG GGCGATCCTG CAGCGCTACG AGGCCGCCGG GGGGCCGCGG GCGCTGGGCT TCCCGATCGC CGACGACGGC GGCACCGCCG ACGGCACCGG TGCGCTCGTC CGGCTGCAGG GCGGGGTCAT CTACTGGTCG GCCCGCACCG GCGCGCACGA CGTGCGCGGG GCCATCCTGG AGCGCTGGCG GTCGCTGGGC GCGCAGACCG GGGCGCTCGG GTACCCGATC GGGGACGACG TCGCGGTGCC CGGCGGGTGG AAGACCGACT TCGCCGGCGG CTCGATCTAC TGGTCGCCGT CCACGGGTCC GCGGATGGTG CGCGGCGCGA TCCTGCAGCG CTACGAGGCC GCCGGGGGGC CGCGGGCGCT GGGCTTCCCG ATCGCCGACG ACGGCGGCAC CGCCGACGGC ACCGGTGCGC TCGTCCGGCT GCAGGGCGGG GTCATCTACT GGTCGGCCCG CACCGGCGCG CACGACGTGC GCGGGGCCAT CCTGGAGCGC TGGCGGTCGC TGGGCGCGCA GACCGGGGCG CTGGGGTACC CGATCGGGGA CGACGTCGCG GTGCCCGGCG GGTGGAAGAC CGACTCCGCC GGCGGCTCGA TCTACTGGTC GCCGTCCACG GGTCCGCGGA TGGTGCGCGG CGCCATCCTG CAGCGGTACG AGGCCGTCGG CGGGCCGGTG AACGAGGGTT TCCCGGTCAG CGACGACGGC CCGACGGCGA GCGGTCGAGG CGCCTTCGTC GAGCTCCAGC GGGGCGCGAT CTACTGGTCG CCCTCGACCG GCGCACACCT CGTGGACGAC TACTTCCTCG AGAAGTACAG GGCAACGGGC GCCGAGACGG GCCCGCTGGG CTTCCCGACG GGGCCGATGG AGGGTCCGTT GACGGTCCTC ACACCCCACC TGCCTTTCAC AGGAGGGCGG CTCTACTGGG TCGCCGGCGG GGGGCGCACC CACATGCTGC GCGGAGCGAT CCTCGACAAG TACGTGGCTC TCGGCGGACC GCAGGGGCTC ATGTGGCCGT CCACGGAGAT CAGGTTGGGC AACCCGGTCA GTGACGACGT CCCCACCGCG ACGCGGGACG GCGTGGAGGC ACGCTTCGTG GACGGTGACA TCGCCTGGTC GGCCGCGACC GGCGCACACG CGATGCGTTC GGCCACCGCC GCTGTCTGGC GGGGGGGTCT CGGCCGGCCC GGTTCGCTGG GCTACCCGAC CACGGACTCG GTGCAGCGCG GTGACGGCAG TGGGTGGGAC ACCGCCTTCC AGCACGGCAG CCTCTTCGAG GCGCGCGACG GGACGGTGAC CAGGATCGGC TGA
|
Protein sequence | MLQAEVVVPA QVAPADPAPQ TTPTNEVTVV LVAPAGSVED GTRLADVVDV VDTQVRDFWA EQTDGAVRFD AVAGADGWVH TAADCTDPFR LWEETAATIG WERGPGKHLL LYVTSTPEEL DGCGYGLAEI GAGPMAGGYG YVRDVRLSVM AHELGHNLGL GHSSALQCDP EGVCRVQPYW DWYDVMGVSW EQVGTLNPAQ ADWLGLAPAT SIAEIAAGQP AATFTLSAAS QRSGTRAVLL RDGDLRFWLE WRPPTGRDAW LGTTANRPRV QPGVLLRRTS YSPDSSYLYD PTPSPRSTWN RDLQTALPLN RPVPVADGLF TVTVQSVSDT GVTVRVVPGT NATTGAIQTA YQRPGVADDL GSPTAPQTCG LPGGGCRRTY EGGAIYWSAA TGARVVTSPV LEEYLALGGP AAMGYPLKDD AVRFWPTTAG LVGTFPGGDI VWTPEFGAHV VRGAIRDRWR AAIDALGPPT ASESAVAGGY ALPFLGGTVY WSPSTGARMV RGAILQRYEA AGGPRALGFP IADDGGTADG TGALVRLQGG VIYWSARTGA HDVRGAILER WRSLGAQTGA LGYPIGDDVA VPGGWKTDFA GGSIYWSPST GPRMVRGAIL QRYEAAGGPR ALGFPIADDG GTADGTGALV RLQGGVIYWS ARTGAHDVRG AILERWRSLG AQTGALGYPI GDDVAVPGGW KTDFAGGSIY WSPSTGPRMV RGAILQRYEA AGGPRALGFP IADDGGTADG TGALVRLQGG VIYWSARTGA HDVRGAILER WRSLGAQTGA LGYPIGDDVA VPGGWKTDSA GGSIYWSPST GPRMVRGAIL QRYEAVGGPV NEGFPVSDDG PTASGRGAFV ELQRGAIYWS PSTGAHLVDD YFLEKYRATG AETGPLGFPT GPMEGPLTVL TPHLPFTGGR LYWVAGGGRT HMLRGAILDK YVALGGPQGL MWPSTEIRLG NPVSDDVPTA TRDGVEARFV DGDIAWSAAT GAHAMRSATA AVWRGGLGRP GSLGYPTTDS VQRGDGSGWD TAFQHGSLFE ARDGTVTRIG
|
| |