Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_2747 |
Symbol | |
ID | 6128957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3053732 |
End bp | 3055585 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641642960 |
Product | TPR repeat-containing protein |
Protein accession | YP_001769619 |
Protein GI | 170740964 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.538174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.232228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGTC CTCTTTCGTC CCGCCTGCGC GGCGGCGCCG CCCTCCTGGC GCTGTTCCTC GCGGCCCCCT CGCCCGTCCT GGCCGCCCGC CTGACGCCCC GCGACGTCCC CGGACCGACC ACCTACGAGC CGGCCGACTC CCTCGAGGGA AATTTCCTGG CGGCCTACAT CGCGGGCGCG TCCCGGGATA CGACGGCGGC GGCCTCCTAC TATCGCGAGG CGGTGAAGGG CGACCCGCGC AACGCCGAAT TGCTGGAGCG CTCCTTCGTG GCGCTGCTGG CGGACGGGTC CCTGCAGGAG GCGTTCCGGG CGGCCGAGAA GCTTACCGCC CGCGACACGT CGAACGGCCT CGCCCAGCTC GCCCTCGGGG TTCAGAAGCT GAAGGCCAAG CAATGGGGCG CGGCGCGCCA GAACCTGCAG CGCGGCGGCC GGGGCGCCAC CGCGGACCTG ACCTCGACCC TGCTGACCGC GTGGTCCTAC GCGGGCGAGG GCCAGGGGAA GAAGGCCTTC GAGACCATCA ACAAGCTCAA GGGCGAGCGC TACATCGGGG TCTTCCGCGA CTACCACGCG GGCCTGATCG CCAGCGTGGT GGGCGACAAG GTCGAGGCCG AGCGGCGCCT GAAATCCGCC TACGACGCGG ACCGCAACAC CCTGCGGATC GTCGACGCCT ACGCGCGCTT CGAGGCCGAT TCCGGCCGGC CCGACCTCGC GGTCGCGGCC TACGAGGCCT TCGACGCGGT GCTGCCGCGC CACCCGATCG TGCGCGACGC CCTCGAGAAG CTGAAGGCCG GCAAACCCCT GCCGCCGCTG ATCGCGAGCG CCCAGGAGGG CGCCGCCGAG GTGCTGTACG GGCTCGGCAC CGCCGGCACC TCGCAGGGCG AGGAGCTGCC GGCCGTCATC TACCTGCGGC TCGCCCTCTA CCTCGCGCCC GAGCATCCCC TGGCGCTGCT CACCCTCGGC GACACGCTCG AGCGCATGAA GATGCCCGAG CGCGCCAACG AGATCTTCGC CCGCATGCCG GCGACCTCGC CGCTGAAGCT CAACACCGAC ATCCAGATCG GGCTCAACCT GGAGCAGCTC GGCAAGAACG ACGACGCGCT GGCCCATCTC GACCAGGTCG CCAAGGCCAA CCCGAAGGAC GTCGACGTGA TCTCGGCGCT GGGCAGCGTG CAGCGCCTGC GCAAGCAATA CGCGGAGGCC GCCGAGACCT ACTCGAAGGC GATCGCCCTG ATCGGCTCGG AGCCGCCCCA GAATTACTGG AACCTCTACT ACTACCGCGG CACCGCCTAC GAGCGGGCCA AGCAATGGCC GAAGGCCGAG GCCGATCTCA AGAAGGCCCT GGAGCTGGTG CCGCAGAATC AGCCGAACGG GCGCGCGCAG GTGCTGAACT ACCTCGCGTA TTCCTGGGTC GACCAGAACA TGAACATCGA CGAGGCCTTC CGCATGCTGG AGAAGGCGGT CGACCTGCAG CCGCGCGACG GGATGATCGT CGACAGTCTG GGCTGGGCGT ATTTCCGCCT CGGCCGCTGG GACGACGCCG TGCGCGAACT GGAGAAGGCC GTCGACCTCA AGCCGGGCGA CCCGACCATC AACGACCATC TCGGCGACGC GTACTGGCGC AGCGGCCGCC GGCTCGAGGC GAAGTTCCAG TGGCAGCACG CCAAGGACCT GAACCCGGAG CCCGAGGATC TCGCCAAGAT CGAGCAGAAG CTGAAGGACG GCCTGCCCGA CGCGGACAAG CCGGCCGCCA CCGCGGAGAG CCGGCCGACG CCGGACCAGC CGGCGATGCC GAAGGGCTCG CCCGCCCCGA GCGATCCGCC CGCGATGGGC TCGACCCCGA AGGGCGGCGG CTAG
|
Protein sequence | MSRPLSSRLR GGAALLALFL AAPSPVLAAR LTPRDVPGPT TYEPADSLEG NFLAAYIAGA SRDTTAAASY YREAVKGDPR NAELLERSFV ALLADGSLQE AFRAAEKLTA RDTSNGLAQL ALGVQKLKAK QWGAARQNLQ RGGRGATADL TSTLLTAWSY AGEGQGKKAF ETINKLKGER YIGVFRDYHA GLIASVVGDK VEAERRLKSA YDADRNTLRI VDAYARFEAD SGRPDLAVAA YEAFDAVLPR HPIVRDALEK LKAGKPLPPL IASAQEGAAE VLYGLGTAGT SQGEELPAVI YLRLALYLAP EHPLALLTLG DTLERMKMPE RANEIFARMP ATSPLKLNTD IQIGLNLEQL GKNDDALAHL DQVAKANPKD VDVISALGSV QRLRKQYAEA AETYSKAIAL IGSEPPQNYW NLYYYRGTAY ERAKQWPKAE ADLKKALELV PQNQPNGRAQ VLNYLAYSWV DQNMNIDEAF RMLEKAVDLQ PRDGMIVDSL GWAYFRLGRW DDAVRELEKA VDLKPGDPTI NDHLGDAYWR SGRRLEAKFQ WQHAKDLNPE PEDLAKIEQK LKDGLPDADK PAATAESRPT PDQPAMPKGS PAPSDPPAMG STPKGGG
|
| |