Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1053 |
Symbol | |
ID | 3104049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1103820 |
End bp | 1105580 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637170237 |
Product | TPR domain-containing protein |
Protein accession | YP_113528 |
Protein GI | 53804652 |
COG category | [N] Cell motility [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.038306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTAAAA TTCGGATACG AAAGTTCTGG CCCAGTGGTG CAGGTACTGT GCCAGGTGTG CTGTTGATTG CCCTGGCACT CGGGTGCGTG GGCCACAGGG CCAAATTTGC GGACGAAACG TTCTCGAGCG ATGAGGCCGA GATCCGTTCC GCGGTCGAGC CGCTGCCGCC CAAAGCTCAG CTCGTCTACC TGGTGCTGGC CGGGGAGTTG GCCGGCCAGC GCGGCCGCTA CGAGGTTGCG CTCGAGCATT ATCTCCAGGC CGCGCGCCTG TCGCGGGACG GGCGCCTTGC GGAACGGGCG ATGCAGATCG CCCTGTTTAT AAAAAAATAT CCCGAAGCGG TCGAAAGCGT GGCGCTCTGG CTGAAGGCCG AACCCCGCCA CGCGGGGGCC CGCCGCATGG CGACCCTGCT CTACCTGAAA GAAGGACGGC GTGACGAGGC GGTGACACAG ATGAAAGTGT TGCTGACGCT GCCGGACGCC GATCTGGAAA ATACGCTGAT CGAGTTGGTG AAGGTGCTCG GCAACGAGGT GCCCAGACAG GATGCAACGG AATTCATGGA CGCCCTGTTG CGGGCATTTC CTGCCATGGC GGATCTCCAT TTCGCAGCCG CCCTTCTCGC CGCCAACCAG GGCGAGTTCC AGCAGGCTCT GAGCGAAACC GAGGAGGCCC TGAAGCTGCA TCCGGACTGG GGCCGGGCCC GAGTACTGCA GGCACAGGTC ATGGCGCAGA TGGGTGATTC GGCGACTGCC GGGGACCTGA TACAGCGCGC GCTCAAGCGC GATCCGGACA ATGCCAGGCT GCGCCTGATC TACTCTCAGT TTCTCATCAA GTCCGGTGAC ATCGAAGGGG CGCGGCGGGA GCTGGAGCGT ATCGTAGCCA AGGAGCCCGG CAATCAGGAC GCCCGGTTCG GACTCGGATT GGCGCTCATC GATCTGGGCC GGCTCGATGC GGCCCGCCGC GAGTTCGCGG CGCTGGCCGC GTCCGAAAAA TGGCGGGTTC AAGCCTACTT TTATCTGGGG CTGATCGATG CCCGCAAGGG CAGATTGAAT GAGGCGCTGG ACTGGTTCGA CCGCGTCACG ACCGGTCCGA CCGAGTTCGA TGCCCGGGTG AACGGCATTA CCGTCCTGAT CAGCCTGGGC CGTTTGACAG AGGCGCGAAC CCGGCTCGCC GACATCCGCC GGCGTTTTCC GAACGAATCG GTTCGCCTGT ATCTCCTGGA GGCCGAGCTG CTTTCCAAGA ACCGAGACTA CGAAGATGCC TTCAATCTGT TGACCGATGC GCTCGGCGAG AATCCGGGGC AGAGCGATCT GCTCTATGCC CGGGCTCTGG TGGCGGAGAA CCTCGGTCGC TTCGACGTCC TGGAAGCGGA TTTGCGCCAG GTGCTGGAAA AGAGCCCCGA TGATCCCAAC GCGCTGAACG CCTTGGGTTA CACGCTCGTC GAGCGGGGCG AACGGTTGGA CGAGGCCAAG GGCTATCTCG ATCGGGCGAT CCGGCTCAAG CCCGATGACC CGGCGATACT CGACAGTTAC GGCTGGCTGC TGTACCGGCT GCGCAAGTAT GCCGAAGCCA TCGAATACCT CCGCCGGGCC TATGACAAGG TTCAGGATCC AGAGATCGCA TCGCATCTGG GTGAGGTTCT GATGGAGTCA GGCCGGCGTC AGGAAGCCCG GAAAATCCTG CGCGAGGCAT GGAAGAAGGC GCCCGAGCAT GAGGACATGC AGCGGATCAG GGCGCGCTAT CCGGAACTGC TGGCGCCGTG A
|
Protein sequence | MPKIRIRKFW PSGAGTVPGV LLIALALGCV GHRAKFADET FSSDEAEIRS AVEPLPPKAQ LVYLVLAGEL AGQRGRYEVA LEHYLQAARL SRDGRLAERA MQIALFIKKY PEAVESVALW LKAEPRHAGA RRMATLLYLK EGRRDEAVTQ MKVLLTLPDA DLENTLIELV KVLGNEVPRQ DATEFMDALL RAFPAMADLH FAAALLAANQ GEFQQALSET EEALKLHPDW GRARVLQAQV MAQMGDSATA GDLIQRALKR DPDNARLRLI YSQFLIKSGD IEGARRELER IVAKEPGNQD ARFGLGLALI DLGRLDAARR EFAALAASEK WRVQAYFYLG LIDARKGRLN EALDWFDRVT TGPTEFDARV NGITVLISLG RLTEARTRLA DIRRRFPNES VRLYLLEAEL LSKNRDYEDA FNLLTDALGE NPGQSDLLYA RALVAENLGR FDVLEADLRQ VLEKSPDDPN ALNALGYTLV ERGERLDEAK GYLDRAIRLK PDDPAILDSY GWLLYRLRKY AEAIEYLRRA YDKVQDPEIA SHLGEVLMES GRRQEARKIL REAWKKAPEH EDMQRIRARY PELLAP
|
| |