Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1811 |
Symbol | |
ID | 4284449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | - |
Start bp | 1967104 |
End bp | 1968894 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638141301 |
Product | TPR repeat-containing protein |
Protein accession | YP_757041 |
Protein GI | 114570361 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | [TIGR02466] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.541585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.135887 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGAGA CGCTGCAGCG CGCCGCGGGC CTGCTGCAGG CCGGACGTTT TGGCGAGGCG CTCACGCTCC TCGACGGGGT CCTCAATACG CAGCCTGACC AGCCCGATGC CCTGATGCTT CGGGCCATGG CCCGAAGCCG GACCGGAGAT GACAGCGGCG CGGTCAGCGA TTTCCTGGCG GCAGGTCTTT GTCACCCCCA GCCGCATGCG GTCCACAATA ATCTCGGCAA CCATCATCAG CGAGCCGGTC GGCTGGACGC TGCGGTGGCG GCGTATCGTG AGGCTTTGCG ATTGGCACCA GGGTTTGCCG ATGCCCGCAT GAATCTGGCG ATTACCCTGT CTCAAGGTGA GGATTTCGAG GCCGCGCGGA CGGCGATCAA TGCGGTTCTC GATCAGGCTC CCAATCACGC GTTGCTGCTC AACGCACTTG GCAATGTCGA GCGGCGCGCC GGGAATCCGG ATGCAGCGAG GCGTGCCTAT GACGGCGCGA TCGCGGCTGA TCCGCAAGCG GTGCAGCCGC GCATCAATCG CGGCGCATTG CGGCGTGAAA CGGGGGAGAT CGAGGCGTCC TGTGCGGATT TGCGTGACGC TTGTGGCATG GCACCAGGTC TGGCCGAGGC CCATGCCCAG CTGGCACACA GCCTTCGCAC CAGTCTCGAT ATCACCGGGG CCGAGCAGGC CTATCGGCAG GCCCTGGCGC TGGCCCCTGA TGACCCGGGA TATCACGCTG ACCTGGCCGG CCTTTTGGCG GAGGCAGGGC AGGGCGACAA GGCTCTGACC ACCTTGCAGG GACGAATCGC GGTGAGCGGC GATCCGCGCC TCTATGAAGT CCTGGCCCGG CTGCAGATGC GCTCAGGGCG GCCGGAAGCG GCACGGGCGG CGGCGGATGC GGCACTGGCC CGTGACGCCA GCGCGGTGAA GGCGGCCATG GTCCGCAGCG AGTTGGGGCT TCGTTGTGGC GATTTGCGAG GTGCGCTGGA GGATGCCCGG TTGGCGTTTG ATGCCAGCGG AGGTGAGGAC TGGGGCGCCC GCCATGTCCT TGCTGAAGCC CTTCTTGTAA ACGCGGATTG GGAAGGCGCG ACCGATGTAC TGGTTGGCGA TCCGCCGGCC GCTCACCTGC AAAAACACCT GGCCCTGCAG TCGGTGGCCT GGCGTCAGAC GGGCAATCCT CGCTACGCCG CCCTTTGCGA TTATGACCGG TTCGCTCGCA AGATGTGGAT TGAGACGCCG CCCGGATATG AGTCTCTGGC TGCCTTCAAC GCGGCGCTGT CCGACCGGAT CGAGCGACTG CATGCCGATG GCGCAGCGCC TCTCGAGCAG ACCCTGTTTG GCGGGACCCA GTCCGCGGGG CGGTTGTGGA ACAATGATGA CCCGGTGATT CGGGCCCTCG CGACGGCGCT GGAGGGTCTG GCGGCCCGAT ATCTGGCAGA ACTGCCTGTG GATCCGTCTC ACCCTTTCCT GGCGCGAAAT ACCGGCCGTT CCAGATTGGC CGGGGCCTGG TCGGTTCGCC TGAGGTCCGG GGGCGGTCAT GTCGACCATG TCCATCCGGC GGGGTGGGTC AGTGCCAGTT ACTATGTCAG TGTTCCGGCA TCCGTGATGG CCGGCGAGCG GGCCGGCTGG CTGCGCATCG GGGCTCCGGG CCTTCCTGGG CTGGACCTCC CAGTCGAACG ATATATTCAG CCAGAACCTG GCTGCGCCAT TGTCTTCCCG TCTTATTTTT GGCACGGCGT CGAGCCATTT GAGAGTGACG AGGTTCGGGT GACCGCACCG TTCGATCTGC TCCCGGTCTG A
|
Protein sequence | MMETLQRAAG LLQAGRFGEA LTLLDGVLNT QPDQPDALML RAMARSRTGD DSGAVSDFLA AGLCHPQPHA VHNNLGNHHQ RAGRLDAAVA AYREALRLAP GFADARMNLA ITLSQGEDFE AARTAINAVL DQAPNHALLL NALGNVERRA GNPDAARRAY DGAIAADPQA VQPRINRGAL RRETGEIEAS CADLRDACGM APGLAEAHAQ LAHSLRTSLD ITGAEQAYRQ ALALAPDDPG YHADLAGLLA EAGQGDKALT TLQGRIAVSG DPRLYEVLAR LQMRSGRPEA ARAAADAALA RDASAVKAAM VRSELGLRCG DLRGALEDAR LAFDASGGED WGARHVLAEA LLVNADWEGA TDVLVGDPPA AHLQKHLALQ SVAWRQTGNP RYAALCDYDR FARKMWIETP PGYESLAAFN AALSDRIERL HADGAAPLEQ TLFGGTQSAG RLWNNDDPVI RALATALEGL AARYLAELPV DPSHPFLARN TGRSRLAGAW SVRLRSGGGH VDHVHPAGWV SASYYVSVPA SVMAGERAGW LRIGAPGLPG LDLPVERYIQ PEPGCAIVFP SYFWHGVEPF ESDEVRVTAP FDLLPV
|
| |