Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2834 |
Symbol | |
ID | 7092997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3112323 |
End bp | 3113558 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643466145 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_002363114 |
Protein GI | 217978967 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.960299 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCCAGT TCTGCGCCGT GTCAGGCCTC GATTATCATA CTCCCGAGAA AGCCGGCGTT TCGCGTCGCG CGGTGCTCGA CGGGCTGGCC GCGGGCGGCC TCGCCAGCCT GCTCGGAACC TTCGCCAAAC CAGCTTTCGC TCAGGCTGCC GACGACGATG TCGTGCGCAT CGGCTACCTG CCGATCACCG ACGCCGCCGC CCTGCTTGTC GCGCATGGCA AAGGCTATTT TGAAGACGAG GGGCTGAAGG TCGAAAAGCC CACTCTCATT CGCGGCTGGG CGCCCCTTGT CGAAGCCTTC GCCGCCGGCA AATTCAATCT CGTCCATCTT CTGAAGCCCG TCGCCCTGTC GATGCGCTAC AACAACAACG TGCCCGTCAA AATCATGGCC TGGGCGCATA CCAACGGCTC CGGGGTCATT GTCGACGGCG GCGCCGACAT CAAGACTTTC GCCGATCTCG GCGGCAAGCA GATCGCCGTG CCGTTCTGGT ATTCCATGCA CAATATTGTG CTGCAATATG CGTTGCGGCA AAGCGGCCTG ACGCCCGTCA TCAAATCCAC TCCCCCCGCG CCGAATGAGA CCAGCCTGCA GGTGATGCAG CCGCCGGACA TGCCGCCTGC GCTCGCCGCC AAGAAGATCG ACGGCTACAT CGTCGCCGAG CCCTTCAACG CCATGGGCGA GCTTGGCGCC GGCGGCAGGA TGCTGCGCTT CACGGGCGAT ATCTGGAAAA ACCACCCCTG CTGCGTCGTC TGCATGCCGC AGCCTCTGAC CGAGCAAAAG CCGGAATGGA CGCAGAAGAT CATCAACGCC ATCGTCCGCG CAGAGATTCA CGCCTCGCAA CACAAGGAGG AGACGGCGCA GCTACTCTCG CGCGACGGCG CCGGCTATCT GCCGATGCCG GCCCCCGTGG TGAAAAGAGC CATGACCCTC TATGAGACGA ACAAGGCCTA TCTCGATAGC GGCGCCATCA GCCATCCGGA CTGGCGCAAC GGCCGCATCG ACTTTCAGCC ATGGCCTTAT CCGTCGGCGA CGCGGCTGAT CGTCGAGGCG ATGAACGAAA CGCTGATCGC GGGCGATCGG GCTTTCCTCT CGAAGCTCGA TCCCGATTTC GTCGTCAAGG ATCTTGTCAA TTACGAGTTC GTCCGCGCCG CCCTTGAAAA ATATCCCGAC TGGAAGCTCG ATCCCAGCGT CAATGCGTCG GATCCCTTCG CGCGGCAAGA GCTTCTCGCG CCATGA
|
Protein sequence | MCQFCAVSGL DYHTPEKAGV SRRAVLDGLA AGGLASLLGT FAKPAFAQAA DDDVVRIGYL PITDAAALLV AHGKGYFEDE GLKVEKPTLI RGWAPLVEAF AAGKFNLVHL LKPVALSMRY NNNVPVKIMA WAHTNGSGVI VDGGADIKTF ADLGGKQIAV PFWYSMHNIV LQYALRQSGL TPVIKSTPPA PNETSLQVMQ PPDMPPALAA KKIDGYIVAE PFNAMGELGA GGRMLRFTGD IWKNHPCCVV CMPQPLTEQK PEWTQKIINA IVRAEIHASQ HKEETAQLLS RDGAGYLPMP APVVKRAMTL YETNKAYLDS GAISHPDWRN GRIDFQPWPY PSATRLIVEA MNETLIAGDR AFLSKLDPDF VVKDLVNYEF VRAALEKYPD WKLDPSVNAS DPFARQELLA P
|
| |