Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2142 |
Symbol | |
ID | 8753813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 2224054 |
End bp | 2226885 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | 40-residue YVTN family beta-propeller repeat protein |
Protein accession | YP_003409197 |
Protein GI | 284990643 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.624762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGC GCACGTACCT GGACTTCGAC GTGCTGGTCG AACCGGCTTC GGCGACCAGC TACCGCGCCC GGGTGCTGCA CTCCCCGGTG GGTGAGACCC GCCCGGTGCC GGTCACCGTC CCCTTCTCCG ACCTGGAGCT GGAGAACTTT CTGCTGCGCA TCGGTCGGCC ACGCCGGTAC CTGGTCCGCA GCGAGGACGC ACCGGAGGCC ACGGCGGTCC GCGACTTCGG CGGCCGGCTG TTCGACGCGG TCTTCCGCGA CCAGGTGCGC AGCGCCCTGA CCGCGAGCCT GGACCAGGCC GAGGGGCGGG ACTGCGGGCT GCGCGTGCGG CTGCGGCTGA CCGACGCACC CGAGCTGGCC GACCTGCCCT GGGAGTACCT CTACGACAAG GACGCACGCC GGTTCCTCGC GCTGTCCGAG TGGACCCCGC TGGTGCGCTA CCTCGACCTC CCGGGGCGAA TCCGTCCGCT GCCCGTGCAA CCGCCGCTGC GTGTCCTGGT GCTGGTCGCC AGTCCCTCCG ACTTCCCGCC CCTGGACGTG GACGCCGAGT GGGCCCGGCT GCACGAAGCG CTGGGCGAGC TGCAGCACGA CGGACGGGTC CGGCTCGAAC GGGCGCCCAA CGGCTCCATG GCCGAGCTGC AACGCCAGCT GCGGCGGGGC CAGCACCACG TCTTCCACTA CATCGGTCAC GGGCGGTACG ACAGCGAGCT CGGGGACGGG CAGCTGGCCA TGGAGGGAGC GACCGGCCGG GCCCAGCCCA TCAGCGGCTC CGACCTCGGC GCCCTCCTGC ACGACCACCG CACCCTGCGC CTGGCTCTGC TCAACTCGTG CGAGGGCGCC CGGGGCGGCC GCACCGACCC CTACTCGGGG ACCGCGCAGA GCCTGGTCTA CCAGGGCATC CCCGCCGTGG TCGCCATGCA GTTCGAGATC ACCGACCGGG CCGCCATCGT CTTCACCCGC GGCTTCTACG AGGCCGTCGC CGACGGCTAC CCGCTCGACG CGGCCATGGC CGAGGCCCGC AAGGCCATCC GGCTCCAGCC CAACCAGGTC GAGTGGGGCA CCCCGGTGCT GTACCTGCGC GCCCCCGACG GGCGCATCTT CGACGTCGCC CACCCGCCGG GCTCCGCCCA CAAGGCGCCG GTCGGCCCCG TCCCCCCGGA GGTCGCTCCG GAGGCCGTCC GCGGGGTCGC GCCGGAACCC GCCCCAGGCG TGCTCCCGGC CCCGCGGCCG GACCCCGGAC AGCCGCCCGG GACCGCCCAG CCACTCGTGG CGGCGCCAGC ACGGCAGGAC GTCCCGGTCA CGACCGACGA CCCCGCACGG GAGGACGGTG GACCGACGGT CGGGGACCGA CAGCGCGACC TCGCCGAGGA CGCCACCGCA CCCAGGCGGG GACCGGGGGA GCGGCGCCGA TCCGGCGATC AGGACGTCAC GGCGGCCGGG AACCCCGGGC CGACCACCGC GCCGGAGCCT CCTCGGACCC GCAGGGCCCC GGCACCGCGG GCCCGGCCGA AGGCGAGGCC GCCGGAGCGG GAGGGCGCCA CCGCGGTCAC CCCGACGGCC CCCCGGACGT CGACGAGGGA CAGCTCGGCC GACGGGACGC CCGCGTCGCC GTCGTCCCGG CCGCCGTCGT CCCGGCCGCC GTCGACACCG TCGCCGTCGA CACCGCCGCC GTCCCCGCCG CCGTCCCCGC CGTCGGGGCG GGTGCCCGTG CCGGCGCGGG GCCCCCAGCC GTCACCGGGG TACCGGTGGC TCCTCATGGC GGTCGTCCTG CTGGTCGCGG CGGCCGCTGG AGCGGTGTTC GTGGCGATCG ACCGGCAGCG GCTGAGCCAG CAAGGCGACC CGGCCGGGAC GATCGCGTCC GAGTCCACCG CCCCGGAGCC TGCCGCGCCC ACCAGCTCCG GCCCGCCACC GGTGTCCCCG AGCATGCCGA TCCCGTCGGC CGGCGCGATC ATCCCGGTGG GCGAGACACC CGGCTATGCG GTGGCCTCAC CGAGCGGGGC CCAGCTCTAC GTCGCCAACC GCGCGGCCCG CACCATCACG GTGGTCGACA CCGAGCTGGA CCGGGTGACC GGGACGATCC CCGTGCCGGT GGGTCCGCCG CAGTTCGTGG TGTTCTCCGC GGACGGCCGG ACGGCGTACC TCAGCCTGTA CGACGAGGGC ACCCGGGACG GGGCCTTCGG CGTCCTCGAC ACCAGGACGT GGAAGATGAT CGAGACCATC CCCCTGGACG GCAAGCCGTG GTCACCCGCG GTCAGCCGCG ACGGGGGCCG GGTCTTCGTA CCCGTCGAGG GTCCGAACAC CGTCGTGGTC ATCGACGCCG GGGCGTACGA GGTCCTGACC GAGATCCCGG TGCCGCCACT ACCGCACTCC GTCGAGTTCA CCCTCGACGG CACGCGGGCT TACGTCGCCG ACCACACCTC CAACGTCGTC GCGGTCATCG ACACCACGAC GGACAGGGTG GTCCGGGAGG TGCCGGTCGA CGCGGGCCCG CACCGCGTGG CGGTGCACCC TGCTCGACCG CTGGTCGCCA ACGTCAACTA CGACGCCGAC ACGGTCACGG TGATCGACAC GAGCACCGAC ACGGTCGTGA CCAGCATCCC GGTCGAGGCG GGGCCTCAGG ACATCACCTG GGCGCCCGAC GGCCAGTTCG CCTACGTCAC CAGCGTCGAC GCGGACACGC TCTCGGTCAT CGCGGCCGGC GACTGGAGCA CCACGGCGAG GATCCCCATC GGCGACGCCC CGACCTCGGT CGCAGTACTG CCAGACGGGT CACGCGGCTA CGTGACCAAC CTGAACGACG GCACCGTCCG GGTGCTCGAC CTCGACGGCT GA
|
Protein sequence | MTERTYLDFD VLVEPASATS YRARVLHSPV GETRPVPVTV PFSDLELENF LLRIGRPRRY LVRSEDAPEA TAVRDFGGRL FDAVFRDQVR SALTASLDQA EGRDCGLRVR LRLTDAPELA DLPWEYLYDK DARRFLALSE WTPLVRYLDL PGRIRPLPVQ PPLRVLVLVA SPSDFPPLDV DAEWARLHEA LGELQHDGRV RLERAPNGSM AELQRQLRRG QHHVFHYIGH GRYDSELGDG QLAMEGATGR AQPISGSDLG ALLHDHRTLR LALLNSCEGA RGGRTDPYSG TAQSLVYQGI PAVVAMQFEI TDRAAIVFTR GFYEAVADGY PLDAAMAEAR KAIRLQPNQV EWGTPVLYLR APDGRIFDVA HPPGSAHKAP VGPVPPEVAP EAVRGVAPEP APGVLPAPRP DPGQPPGTAQ PLVAAPARQD VPVTTDDPAR EDGGPTVGDR QRDLAEDATA PRRGPGERRR SGDQDVTAAG NPGPTTAPEP PRTRRAPAPR ARPKARPPER EGATAVTPTA PRTSTRDSSA DGTPASPSSR PPSSRPPSTP SPSTPPPSPP PSPPSGRVPV PARGPQPSPG YRWLLMAVVL LVAAAAGAVF VAIDRQRLSQ QGDPAGTIAS ESTAPEPAAP TSSGPPPVSP SMPIPSAGAI IPVGETPGYA VASPSGAQLY VANRAARTIT VVDTELDRVT GTIPVPVGPP QFVVFSADGR TAYLSLYDEG TRDGAFGVLD TRTWKMIETI PLDGKPWSPA VSRDGGRVFV PVEGPNTVVV IDAGAYEVLT EIPVPPLPHS VEFTLDGTRA YVADHTSNVV AVIDTTTDRV VREVPVDAGP HRVAVHPARP LVANVNYDAD TVTVIDTSTD TVVTSIPVEA GPQDITWAPD GQFAYVTSVD ADTLSVIAAG DWSTTARIPI GDAPTSVAVL PDGSRGYVTN LNDGTVRVLD LDG
|
| |