Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_0331 |
Symbol | |
ID | 4284991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 391123 |
End bp | 393057 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638139794 |
Product | TPR repeat-containing protein |
Protein accession | YP_755562 |
Protein GI | 114568882 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00856051 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTGCGGCT CAATCCAGGG GGGATTGCGC CATCATGATC AACAGCTTGC CCAGTGCATT GGATGTCGCG AAAGCGATGG GCGGCGCATG AACGCCTTCC TCACAGAACT TCGCCGTCGC AATGTCTTCC GCGTCGCCGC CGCCTATCTG GTGGTCGGCT GGTTGCTGAT CCAGGTGACC TCGGTCGCCA AACCGGCTCT GCACCTGCCC GACTGGACCG ACACGCTCGT CTTTTTCCTG CTGGCCCTTG GCCTACCCGT CGCGCTGCTG CTGGCGTGGG CGTTCGAGAT GACGCCGGAG GGCATGCGCC CGACGCAAAC CGCCGAAAGC CCCACCGGAT TCCGCCCGCT CGGCGGAACG GATTTCGTCA TCATCGGCCT GCTGGCTGTT GTGATCGCCA TGACCGGATT CCAGCTCGTG ACCCGCCCGG ATTCCGCCCC GATCGAAGCT TCCTCCGGTG AAGCCATAAT TCAGGCGCCG GCCGATGCCT CGGTCGCCGT CCTGCCCTTC GCCGACCTCT CGCCGGCGGG TGACCAGGCC TATTTCGGCG ATGGCATGGC CGAGGAAATC CTCAATGTCC TGACCCGGGT CGACGGGCTG CATGTCGCCT CGCGCACCTC TGCCTTCCAG TTTCGCGGCG ACACCACGGG CATCCCGGAT ATCGCTCGCG CCTTGAATGT CCGCCATGTT GTGGAGGGCT CGGTACGCAA GTCCGGCGAC CAGCTTCGCA TCACCGCGCA GCTGATCGAT TCCGAAGGCG ACCGTCATCT CTGGTCCGAC ACGTTTGACC GCCCGCTCAC CGCGGAGAAC GTCTTCGCCA TCCAGGATGA TATTGCCAAC GAGATCGTCC AGGCGCTCTC GCAAGCGCTG GATCTTGACG GGTTGGAAAG CGTCGCCATC GAATCTGACA CCGGCGATCT TGATGCCTAT GACCTGTTCC TGCAGGCTCA GGCAATCTTC TTCGCCCGGT CCTCGGACAA TGTGCTGGAA GGCATCTCGC TGCTCGAACG CGCGGTCACC GTCGACCCGC GCTTTGCCCG GGCCTGGGCG CTGTTGGCCG CCTTCAATTC GGTCACACCC AGCTGGATCG ACACACGGGA CATGGACCGC GACTTCATCG CCCTTGCCCA GGATGCGGCT GACCGCGCTA CAGAACTGAA CCCCGCCCTC GCACTGCCCT ACTCGATCCG ATCCAACCTG CTCGCCGCCT CGGCTGACTG GGAGGCCGCC ATGGCGCAGG CCAATGCCGC GGTCGAACGC GAACCGGAAA CCGCGAACGT CTGGTATTTT CGCGGCGGCA TATCTTTGAA TGTCGGACAG TTCGATGCCG CCGCGGCCGA CTACCAGACC TGTCTGGATA TCGATCCGGC TTATCATATC TGTCGGCGAT ATCTCGCCTT TGCCGAGCTC TACCGCGGCA ACACCCGTCG CGCGACAGAG CTGTTTGCCG AGGGCATGGT GGCCGGGCAA GAGTCCCTTT TGTCCGTCGT AGCCCCGGCT TATTTCGCCC TGGGCAACGA CCAGGCCGGT CTCTATTTCA TCGCCTATAT GACTGCGGTT GCAGACACGC CTCATCTGAC CGAAGCACTC TATCGCTACT ACACCGACCT CGACGCGACC GATGCCGAGC TGGAGGCGAT GAGCCGTCAG TCCTACGTCG CCCTGCACGG CTCGCTGGAG GGTTATCAGC GCGTCGTATT CGACTTCTGG GTCGCCGGCC AGACGGTTTA CAGTCACGCG GAAGTGTGGA GTCCGCTCGT ACCGGAGCGC TTCCGGACCG CAACCCGTGA CAGATTTCAG ATCACGCGGC GACAGGTCCT CATCGACTTG GGCCTCCCGG CTTACTGGCG GGCCAACGGC TTTCCCCCGC AATGCCGCCC GATTGGCGCA GACGATTTCG AATGCGGCTG GATCGAGGAT CCCGACCTGC CATGA
|
Protein sequence | MCGSIQGGLR HHDQQLAQCI GCRESDGRRM NAFLTELRRR NVFRVAAAYL VVGWLLIQVT SVAKPALHLP DWTDTLVFFL LALGLPVALL LAWAFEMTPE GMRPTQTAES PTGFRPLGGT DFVIIGLLAV VIAMTGFQLV TRPDSAPIEA SSGEAIIQAP ADASVAVLPF ADLSPAGDQA YFGDGMAEEI LNVLTRVDGL HVASRTSAFQ FRGDTTGIPD IARALNVRHV VEGSVRKSGD QLRITAQLID SEGDRHLWSD TFDRPLTAEN VFAIQDDIAN EIVQALSQAL DLDGLESVAI ESDTGDLDAY DLFLQAQAIF FARSSDNVLE GISLLERAVT VDPRFARAWA LLAAFNSVTP SWIDTRDMDR DFIALAQDAA DRATELNPAL ALPYSIRSNL LAASADWEAA MAQANAAVER EPETANVWYF RGGISLNVGQ FDAAAADYQT CLDIDPAYHI CRRYLAFAEL YRGNTRRATE LFAEGMVAGQ ESLLSVVAPA YFALGNDQAG LYFIAYMTAV ADTPHLTEAL YRYYTDLDAT DAELEAMSRQ SYVALHGSLE GYQRVVFDFW VAGQTVYSHA EVWSPLVPER FRTATRDRFQ ITRRQVLIDL GLPAYWRANG FPPQCRPIGA DDFECGWIED PDLP
|
| |