Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4327 |
Symbol | |
ID | 8756021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 4548536 |
End bp | 4551439 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | LGFP repeat protein |
Protein accession | YP_003411260 |
Protein GI | 284992706 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.343026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGTCGC GCCTCCGGCT GGTCGCCACC GCCCTTGTCG GCCTCGGGCT GGCGTTCGCC TTCGTCGCCG GGGTCGCCGG CTCCTCACCG CAGCAGGCCG ACCTGCACGA GGCGGCCGAC CTGAGGCTGT TCGACCCGGG CAGCATCATC AGCGACGACC TGTTCTTCAA CGGCGCGGCG ATGAGCGGCA GCGAGGTCCA GGCGTTCCTG GACTCGAAGG GCGTCAACTG CCGGCCGGCG GCCGACGGGA GCCCCTGCCT GAAGCACTAC CGGCAGACCA CCGGCGACCG GCCGGCGGAC CAGTACTGCC GGGGGTACGC CGGCCGCCCC GACGAGAGCG CGGCCGACAT CATCACCAAG GCCGGCGCGG CCTGCGGGGT CAGCCAGCGG GTGCTGCTGG TGATGCTGCA GAAGGAGCAG TCCCTGGTCA CCGGCACCGG GTCCGTCAAG CGCTACCGCG AGGCGATGGG GTACGCCTGC CCGGACAGCG CCGGCTGCGA CCCGGCCTAC AACGGCTTCG CCAACCAGGT CTACAGCGCC GCCCGCCGGT ACCAGGTCTA CGCGAAGAAC CCGACGAGCT ACGCGCACCG CCCGGGGATC ACCGTCCAGA TCCGGTACTT CCCGGAGGGG ACCACGCCGG ACAAGTTCAA CTACCAGAGC AGGGACTGCG GCAAGGTCTC GGTCTACATC CGCAACCAGG CCACCGCGGG GCTGTACAAC TACACGCCCT ACGTGCCCAA CCAGGCCGCG CTGGACGCCG GTTACGGAAC CGGCGACCGC TGCTCCACCT ACGGCAACCG GAACTTCTTC CACTACTACG TCGACTGGTT CGGCAGCACC CAGTCGGGCG ACTCGTCGAT CGCCGAGAAG CACCGCCTGC TCATGGCCTC CGGGGTGGAC CTGGGCGCCC CCACCTCCGA GGTGGTGTGC AACCTGCCCA ACGGCGGCTG CTGGCGCAGC TGGCAGCACG GCACCATCTT CTGGTCGCAG TACACCGGGC CCCACGTGGT CCGCGGCGCG ATCCTCCAGC ACTTCCTCGC CCTCGGCGGG GTGCCCTTCC TCGGCTACCC GACCGGCGAC GACACCGTGG CGCCGGTGAA CCACGGGTAC TTCACCGACT TCCAGGGCGG GGCGATCTAC TGGTCGCAGG CCACCGGGGC GCGCGAGGTG CGCGGCAACC TGCTCGACGC GTGGCGCGCC AAGGGTGCCC AGGCCGGCGT GCTGGGTTAC CCGGTCGGCG GGGACGAGGC GGTCCCCGGT GGATTCCGCT CCCGCTTCCA GGGCGGGACG CTGTACTGGT CGCCGGCCAC CGGCGCCCGG ATGGTCCGCG GCGCGCTGCT CGCCCGGTAC GAGGCCGCCG GCGGCCCCCG GGTGATCGGC TTCCCGGTGG CCGACGAGCG GCCCACGGCG CGCAGCGGCG CGGCCGTCGA CCTCACCGGC GGGGCCGTCT ACTGGAGCTC GGCGACCGGC GCGCGCGTCG TCCGCGGCGA CATCCTGGCC ACCTACCGGC GGTGGGGCGC GGAGGCGGGC GTCCTCGGCT ACCCGACCGG CGACGACCAG GCCCACGCCT CCGGCTTCCG GACCACCTTC CAGGGCGGGC ACGTGTACTG GTCGGCGCCG ACCGGGGCGC ACGTGCTGCG CGGGGCCATC CTCGACCGCT ACCTCGCCCA CGGCGGCGCG CCGGTGCTCG GCTTCCCGAC GACGGACGAC GTCGCCGCGG CGAACGGCGG TGCCAAGGCC GACCTGCAGG GCGGGGCGGT CTACTGGTCC TCGGCGACCG GCGCACACGT CGTGCGCGGG GACATCCTGG CCAGGTGGCG GGAGTGGGGC GCGGAGTCCG GTGCGCTGGG CTACCCGACC GGTGACGACG CGGCCGCCCC GAACGGCGGG TACCTGACCA CCTTCCGCGG CGGAACGGTG TGGTGGTCCC AGCCGACCGG GGCGAAGGTG CTGCGCGGCG CGATCCTCGA GCGCTACGTG GCGCAGGGCG GGCCCCGGGT CCTCGGCTAC CCGACGACGG ACGACGTCGC CGCGGCGAAC GGCGGTGCCA AGGCCGACCT GCAGGGCGGG GCGGTCTACT GGTCCTCGGC GACCGGCGCA CACGTCGTGC GCGGGGACAT CCTGGCCAGG TGGCGGCAGT GGGGCGCGGA GACCGGTGCG CTGGGCTACC CGACCGGTGA CGACACGGCC GCCCCGAACG GCGGGTACCT GACCACCTTC CGCGGCGGCA CCGTCTACTG GTCGGCCCCC ACCGGGGCGA AGGTGGTGCG CGGCGCGATC CTGCAGCGCT ACCTGGCCGC CGGCGGTCCG CAGGCGCTGG GCTACCCGAC GACGGACGAC GTCGCCGGGG CCGGCGGTGG CGCCAAGGTC GACCTGCAGG GCGGGGCGGT CTACTGGTCC TCGGCGACCG GGGCGCGCGT CGTGCGCGGG GACATCCTGG TCAAGTGGCG GCAGTGGGGC GCGGAGACCG GTGCGCTGGG CTACCCGACC GGTGACGACA CGGCCGCCCC GGACGGCGGG TACCTGACCA CCTTCCGCGG CGGCACCGTC TACTGGTCGG CCCCCACCGG GGCGAAGGTG GTGCGCGGCG CCATCCTCGA GCACTACCTG GCCGTGGGCG GCCCGGCCGC CGTGGGCTAC CCGACGGCCG ACGACGGGCC TGCGCCCGGC GACGGTGGCG CCAAGGTGGC GCTGGAGGGT GGGGCGATCT ACTGGTCGGC GGCGACCGGC GCGCACCTGG TGCTGGGCGA CTCCGCGGTG GCCTTCGTCC AGATGGGGGA GACGACGTCC TACCTCGGGT TCCCCACCTC CGACACGGTG GAGACCGATG GTGTGGCGCG CACCGAGTTC CAGGGTGGCG TCATCGAGGT GCAGAACGGG GTGGCCAGCG CCCACCGGCG CTGA
|
Protein sequence | MRSRLRLVAT ALVGLGLAFA FVAGVAGSSP QQADLHEAAD LRLFDPGSII SDDLFFNGAA MSGSEVQAFL DSKGVNCRPA ADGSPCLKHY RQTTGDRPAD QYCRGYAGRP DESAADIITK AGAACGVSQR VLLVMLQKEQ SLVTGTGSVK RYREAMGYAC PDSAGCDPAY NGFANQVYSA ARRYQVYAKN PTSYAHRPGI TVQIRYFPEG TTPDKFNYQS RDCGKVSVYI RNQATAGLYN YTPYVPNQAA LDAGYGTGDR CSTYGNRNFF HYYVDWFGST QSGDSSIAEK HRLLMASGVD LGAPTSEVVC NLPNGGCWRS WQHGTIFWSQ YTGPHVVRGA ILQHFLALGG VPFLGYPTGD DTVAPVNHGY FTDFQGGAIY WSQATGAREV RGNLLDAWRA KGAQAGVLGY PVGGDEAVPG GFRSRFQGGT LYWSPATGAR MVRGALLARY EAAGGPRVIG FPVADERPTA RSGAAVDLTG GAVYWSSATG ARVVRGDILA TYRRWGAEAG VLGYPTGDDQ AHASGFRTTF QGGHVYWSAP TGAHVLRGAI LDRYLAHGGA PVLGFPTTDD VAAANGGAKA DLQGGAVYWS SATGAHVVRG DILARWREWG AESGALGYPT GDDAAAPNGG YLTTFRGGTV WWSQPTGAKV LRGAILERYV AQGGPRVLGY PTTDDVAAAN GGAKADLQGG AVYWSSATGA HVVRGDILAR WRQWGAETGA LGYPTGDDTA APNGGYLTTF RGGTVYWSAP TGAKVVRGAI LQRYLAAGGP QALGYPTTDD VAGAGGGAKV DLQGGAVYWS SATGARVVRG DILVKWRQWG AETGALGYPT GDDTAAPDGG YLTTFRGGTV YWSAPTGAKV VRGAILEHYL AVGGPAAVGY PTADDGPAPG DGGAKVALEG GAIYWSAATG AHLVLGDSAV AFVQMGETTS YLGFPTSDTV ETDGVARTEF QGGVIEVQNG VASAHRR
|
| |