Gene Msil_3769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3769 
Symbol 
ID7090697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4125340 
End bp4126563 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content67% 
IMG OID643467054 
Productnitrate transporter, putative 
Protein accessionYP_002364013 
Protein GI217979866 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGG GGGAGCCAGA AACTGCGCCG GCTGCCGCGC GCGGCGCCGG TCCGGCGCGG 
ATGCGGATCG GCTTCCTGCA GCTGACCGAC GCCGCGCCGC TGATCGTCGC GCAGGAGTTC
GGCTATTTCG CCGAAGAGGG CATCGACGCC GAGCTCGCGC CGGAGCCGTC CTGGGCGAAT
ATCGCCGACA AGCTGGTCTA TGGATTTCTT GACGCCGCGG TGATCGTGCC GCCGCTTGCC
TTCGCCATCG AACTCGGGCT TCGCGGCGTC GCGCAGCCTC TGATCGTTCC CTGCGCGATC
AGCCTCGGCG GCAATACGAT TACGCTCGCG CGGGATGTGG CCGCGGAGGT TCGCGCCCTC
GTCAAGCGAG ACGGGCGATC GACGGCCGAT GCGCTCGCCG GCTGGCTGCG CGCGCAAAGC
GCGCCGCCGC CGCCCTTCGG CGTGGTCCAC GCCTATTCGA CGCATAATCT CCTGATGCGC
TATTGGCTGG CGACGGCCGG CGTCGATATC GGGCGCGAGG CGACGCTCGC CGTCGTGCCG
CCGGCCCTCG CCGTCGACGC TTTGCGGTCG CGCCAGATTG TCGGCTTTTG CGCCGGCGCG
CCGTGGGGCG AAATCGCGGC GCGCGCGGGT GTTGGCGTCA CAGTCGCGAC CTCAAGGGAT
ATTTGGCAGA ACGCGCCGGA GAAGGCCTTC GCCGTCCGCG AATCCTGGAT CGATCAGCGT
CCGGACGCTT TGGGCGGCGC CGTCCGCGCG CTGGTTCGCG CGGCGCAATT TTGCGACGCG
CCGGAAAACG CATCCTATAC GGCCTCGCTT CTCTCACGGC AGAAATATCT GAATGTCGAT
AGTCATGCGA TCTTGCCCTC GCTGCCCGGC GGCGGCATCG CGCGGGACAA TCTGTCGAGC
TTTTATCGCA ACGCCGCGAC CTTTCCTTGG CGGTCGCATG CCTTGTGGTT TCTGCGCGAG
ATGACGCGCT GGGGGTTGAT CGAGGCCGGC CTCGACCTTC CGGCGTTGGC CGCGCGGGTT
TATCGTCCCG ATCTTTACAG ATCGGCCGTG AAGCCGCTCG ACATTCCGAC GCCGCTCGTC
GACGCGAAGA GGGAAGGCGC CCATGCCGCC CCCTGGCTGC TGGATGCTTC GCCCGCGCCG
ATTTCGATGA GCGCCGATTT GTTTTGCGAT GGGGCTATAT TCGACCCAGG CGCCTTGATT
GACGCTGCGC GGCGCAATTT TTGA
 
Protein sequence
MSRGEPETAP AAARGAGPAR MRIGFLQLTD AAPLIVAQEF GYFAEEGIDA ELAPEPSWAN 
IADKLVYGFL DAAVIVPPLA FAIELGLRGV AQPLIVPCAI SLGGNTITLA RDVAAEVRAL
VKRDGRSTAD ALAGWLRAQS APPPPFGVVH AYSTHNLLMR YWLATAGVDI GREATLAVVP
PALAVDALRS RQIVGFCAGA PWGEIAARAG VGVTVATSRD IWQNAPEKAF AVRESWIDQR
PDALGGAVRA LVRAAQFCDA PENASYTASL LSRQKYLNVD SHAILPSLPG GGIARDNLSS
FYRNAATFPW RSHALWFLRE MTRWGLIEAG LDLPALAARV YRPDLYRSAV KPLDIPTPLV
DAKREGAHAA PWLLDASPAP ISMSADLFCD GAIFDPGALI DAARRNF