Gene Msil_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2834 
Symbol 
ID7092997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3112323 
End bp3113558 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content62% 
IMG OID643466145 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_002363114 
Protein GI217978967 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.960299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCCAGT TCTGCGCCGT GTCAGGCCTC GATTATCATA CTCCCGAGAA AGCCGGCGTT 
TCGCGTCGCG CGGTGCTCGA CGGGCTGGCC GCGGGCGGCC TCGCCAGCCT GCTCGGAACC
TTCGCCAAAC CAGCTTTCGC TCAGGCTGCC GACGACGATG TCGTGCGCAT CGGCTACCTG
CCGATCACCG ACGCCGCCGC CCTGCTTGTC GCGCATGGCA AAGGCTATTT TGAAGACGAG
GGGCTGAAGG TCGAAAAGCC CACTCTCATT CGCGGCTGGG CGCCCCTTGT CGAAGCCTTC
GCCGCCGGCA AATTCAATCT CGTCCATCTT CTGAAGCCCG TCGCCCTGTC GATGCGCTAC
AACAACAACG TGCCCGTCAA AATCATGGCC TGGGCGCATA CCAACGGCTC CGGGGTCATT
GTCGACGGCG GCGCCGACAT CAAGACTTTC GCCGATCTCG GCGGCAAGCA GATCGCCGTG
CCGTTCTGGT ATTCCATGCA CAATATTGTG CTGCAATATG CGTTGCGGCA AAGCGGCCTG
ACGCCCGTCA TCAAATCCAC TCCCCCCGCG CCGAATGAGA CCAGCCTGCA GGTGATGCAG
CCGCCGGACA TGCCGCCTGC GCTCGCCGCC AAGAAGATCG ACGGCTACAT CGTCGCCGAG
CCCTTCAACG CCATGGGCGA GCTTGGCGCC GGCGGCAGGA TGCTGCGCTT CACGGGCGAT
ATCTGGAAAA ACCACCCCTG CTGCGTCGTC TGCATGCCGC AGCCTCTGAC CGAGCAAAAG
CCGGAATGGA CGCAGAAGAT CATCAACGCC ATCGTCCGCG CAGAGATTCA CGCCTCGCAA
CACAAGGAGG AGACGGCGCA GCTACTCTCG CGCGACGGCG CCGGCTATCT GCCGATGCCG
GCCCCCGTGG TGAAAAGAGC CATGACCCTC TATGAGACGA ACAAGGCCTA TCTCGATAGC
GGCGCCATCA GCCATCCGGA CTGGCGCAAC GGCCGCATCG ACTTTCAGCC ATGGCCTTAT
CCGTCGGCGA CGCGGCTGAT CGTCGAGGCG ATGAACGAAA CGCTGATCGC GGGCGATCGG
GCTTTCCTCT CGAAGCTCGA TCCCGATTTC GTCGTCAAGG ATCTTGTCAA TTACGAGTTC
GTCCGCGCCG CCCTTGAAAA ATATCCCGAC TGGAAGCTCG ATCCCAGCGT CAATGCGTCG
GATCCCTTCG CGCGGCAAGA GCTTCTCGCG CCATGA
 
Protein sequence
MCQFCAVSGL DYHTPEKAGV SRRAVLDGLA AGGLASLLGT FAKPAFAQAA DDDVVRIGYL 
PITDAAALLV AHGKGYFEDE GLKVEKPTLI RGWAPLVEAF AAGKFNLVHL LKPVALSMRY
NNNVPVKIMA WAHTNGSGVI VDGGADIKTF ADLGGKQIAV PFWYSMHNIV LQYALRQSGL
TPVIKSTPPA PNETSLQVMQ PPDMPPALAA KKIDGYIVAE PFNAMGELGA GGRMLRFTGD
IWKNHPCCVV CMPQPLTEQK PEWTQKIINA IVRAEIHASQ HKEETAQLLS RDGAGYLPMP
APVVKRAMTL YETNKAYLDS GAISHPDWRN GRIDFQPWPY PSATRLIVEA MNETLIAGDR
AFLSKLDPDF VVKDLVNYEF VRAALEKYPD WKLDPSVNAS DPFARQELLA P