Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3106 |
Symbol | |
ID | 7092455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3409766 |
End bp | 3412702 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643466416 |
Product | pentapeptide repeat protein |
Protein accession | YP_002363377 |
Protein GI | 217979230 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTA CTCGTAAACC CGTTGCCCGC GCCGAGACTG GCATTTTGTT GGTCCGGGAC TCTACATTGA AGGCCGAATG GCCGAAATTC GTGGCCAACA TCGTCAAGGT GGTTGGCTCG GGCTTTGCGA TTTTCCACGG CCACCTTGAT CATATCCCTG AAGGTATCTC TAGCTTGGTC GAGGCTGCTG GTGCCATCTC TATAGATGCG CCGCCTGGAG AGCGCGGTTG GTTATTGTTC GCGTTAGGAT TTGCCTGGGC TTTTGACGAG CTTAGAGCTC GCGGTCAACT TGACGACATC GGTGTCGAAA CGGCCGTGCG CGATGCGCTC GCCGAAGCGA AGGAAAGAGT GGATCAAGGC GAGCAGATCA TCCCGACGTC ATTCCTTGAC CAACCCACTA CGCTACCCCT CTACCGACTG GTGCGAGACT CATTTGTCGA ACGGAAGGGC GCGTTCCGAG CGGCAGGGGG CGAGAAGGAC GACATATTGC GGGCGCGTTT CGACGCGGCG TTCGATCGTG CAATATTCCT GCTTTGGTCA CAGTCGCCGG AACAATACCA ACCGCTGGCA GCCGCCCTGA ATGCTCCGGG AGCAGCCGCC CATGCGTTCC GTCTGGAATG GGACGCATAT CGAAAAGGCC TGATCCACGA ATTCGAGGTT AAGCCGGTTT TTGGGCAAGA AACCCTCAAG ATTGCACTAT CGCAAATTTA TGTTCCGCTT CGGGGAGTCT GGCGTGAAGA ACACGACGAT CCACGACCTC CGTCCCGAAG CGTCAAGACG CGGGCGCTTA CACTCTCCCC TTCACGTTGG AAGGCGTCTC ATTTTGTCTG GTTGGACGAC GAACTTGACG CTTGGGCGAC TGGTGGCCAC GAGGAATACG AGTTGAAACT GTTGGGCGGT GGCCCCGGCA GCGGAAAGTC GACGATCGCG AAAGCTCTCG CGCGTAGGCT CGCGGCACGC GACGACTGCC GACCTCTTTT CATCTCCCTC GCTAACATCG ACGCAGCTAT TGATCTGCGT GAGGCGATCA ACGCCCACTT CACTAAGCGC ACGGCAAATC CATTCAGACA GCCGCCCCTT GCCCGCAATG TGGTAGAGGA TGGGCCCCCG CTTGTTCTCA TCTTCGACGG CCTCGATGAA CTGGCCCGAC CGGGGGAGGC TGCTAACGAC ATCGCTCGCG ACTTCATGGG CAAGCTTGGG CAGCTTACGT CCGAATTGCG CGGAGATAAA AAGAGCCGTC TTCGGGTCAT CGTCACGGGG CGAATGCCGT CCTTCCAGGC CGCTCAGCGC TTTGCGGGCG CGGTCAATAG GAAAAGCTAC GAGGTCGTGG GCTTTCTTCC CATTAAGGGG GCCGAACTTT TTCAGACGAG CGAGGAATTC GGCGGTAAGG GCCTAGACGT CGCTTCAGGG GTGCTAGAGT CTGTTCTGAC TGTTGCACCC CCTCAAAAAG CAGCGCCTGA GCAGCAGAGC GACATTGCCC GCATCGATCA GAGGGAAACC TGGTGGAAAG GTTATGCAGC AGCGGTCGGA CTTTCCTTAA AACCACCCCA TGCCTTATCC GCCCCCGCCC TGACAGAACT CACTCGCGAG CCCTTGCTAT GCTACCTCCT CGTGCTCTCG GGATATCTTA CCGAAGGAGC GCAATTAGCT GCCGGGAACC TGAACTTGAT CTATCGTGAA CTGATTGACG AGGTGTGGCG ACGTGGTTGG GGAGACGACC CTGATAACGG GCGGCGCCCA GGTGCCGGGA GATTTCTAAG CCAGGAGGGC TTCAATAGAC TGATGGAAAC CATTGCTCTG GCCGGATGGC AGGGAGGCGA CACGCGAGTG TGCAGCGAAG CTCGATTTCA TAATGCGGCA AGGTCTCTGC GAACTGGCAA TGCCTGGGAT GACTTCGTTC GAGACAACGG CTCCGATATT ACTAACCTCG CAATGAACTT CTACCTCAAG AACGCTGAGA TTGAGACGCG TGGTTTTGAA TTTACACATA AGAGCTTTGG CGACTATCTC GCCGGTCGAG CGCTTCTGGA AGTAGCGTTA GATCTGATCA CTCTTATTGA TCGCCGGCCC GAGACGGCGA TGAAGGAATG GTTCGATGTC ACGGCCACCG GCAACCTCAC CAACGAATTA TTGGACCACA TGCGGCGCGA ATTGCAATTA CGTGCCAGCA GTTCTGAGGG ATTAGAAGAA GTTCGCGCAC TAAAATCTGG ATTCGAACGT TTGGCTTCCG CGGTCATCCG TGATGGTTTG CCGGCGAATG GCAATGCGGA TCAGCCCTGG AGAGTGGCGG AAGCAAGGCA GCGCAACGCC GAGGTCATGG TATGGGCGGT GCTCAATGCA TGCTCATTAG CCATCGCCGC CACTGATCCT GAGCGCGCAA AAATTGAGGT TGCGTGGCCC GATAAGGCCG CTTTCGGCAA CCTTCTGCGA CGAGTTGTAG GACAGACCGA TCTCACCACA AGCTTCGGCC CGAAAATGTT CTATAAAATT TTGTTTTTTC AATTTCAATT TGCCATTGCC ACGGCAGCTG CCCCGGTCAT GAAATGCTTC GCTTATCTCG TGGCTCCTGA AGCAGACTTG ACAGGGCAAG CACTCTTTGC TGCTGATCTG CAACACGCCG ATCTGCGCCG CGCGCGCCTC GAGTTAGCCT CGTTGCGCAC ATGCAAGTTG AAAGGGACCA ACCTCTCCAA CGCAAATTTC AGCGGGTCGG TGATCGAAAC CACCGACTTC GAAGACGCTA ATCTGGAGGG CGCTCAATTC AATGATGCTA AGCTGCTAGT TTCTCCTCTG GATAAGGCCA GTGTAAAGGG AGCTACGCTT CTAAGGACGC TCATTCTCCC CGGCAAGGGC GACCGCGGCG CCGATCTCAA AAGGAGAGGA GCGAACGTTA CAGGTGTTCA AATGGCAGAT ATGCTGCGGA AACCTGATTC AGTTTGA
|
Protein sequence | MARTRKPVAR AETGILLVRD STLKAEWPKF VANIVKVVGS GFAIFHGHLD HIPEGISSLV EAAGAISIDA PPGERGWLLF ALGFAWAFDE LRARGQLDDI GVETAVRDAL AEAKERVDQG EQIIPTSFLD QPTTLPLYRL VRDSFVERKG AFRAAGGEKD DILRARFDAA FDRAIFLLWS QSPEQYQPLA AALNAPGAAA HAFRLEWDAY RKGLIHEFEV KPVFGQETLK IALSQIYVPL RGVWREEHDD PRPPSRSVKT RALTLSPSRW KASHFVWLDD ELDAWATGGH EEYELKLLGG GPGSGKSTIA KALARRLAAR DDCRPLFISL ANIDAAIDLR EAINAHFTKR TANPFRQPPL ARNVVEDGPP LVLIFDGLDE LARPGEAAND IARDFMGKLG QLTSELRGDK KSRLRVIVTG RMPSFQAAQR FAGAVNRKSY EVVGFLPIKG AELFQTSEEF GGKGLDVASG VLESVLTVAP PQKAAPEQQS DIARIDQRET WWKGYAAAVG LSLKPPHALS APALTELTRE PLLCYLLVLS GYLTEGAQLA AGNLNLIYRE LIDEVWRRGW GDDPDNGRRP GAGRFLSQEG FNRLMETIAL AGWQGGDTRV CSEARFHNAA RSLRTGNAWD DFVRDNGSDI TNLAMNFYLK NAEIETRGFE FTHKSFGDYL AGRALLEVAL DLITLIDRRP ETAMKEWFDV TATGNLTNEL LDHMRRELQL RASSSEGLEE VRALKSGFER LASAVIRDGL PANGNADQPW RVAEARQRNA EVMVWAVLNA CSLAIAATDP ERAKIEVAWP DKAAFGNLLR RVVGQTDLTT SFGPKMFYKI LFFQFQFAIA TAAAPVMKCF AYLVAPEADL TGQALFAADL QHADLRRARL ELASLRTCKL KGTNLSNANF SGSVIETTDF EDANLEGAQF NDAKLLVSPL DKASVKGATL LRTLILPGKG DRGADLKRRG ANVTGVQMAD MLRKPDSV
|
| |