Gene Msil_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1810 
Symbol 
ID7090927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1970748 
End bp1971908 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID643465137 
Productprotein of unknown function DUF451 
Protein accessionYP_002362117 
Protein GI217977970 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2822] Predicted periplasmic lipoprotein involved in iron transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.742224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCGA CGCCCTCTAA AAAACCCCCG GCGGGATGGG GCATAAAGCT GCTCGTCGGC 
GCCGCTGCGC TGCTTGTCCT TGTGGCGGGA GTCGCCTTCT ATCTGGCGTC GAAAAAGGCG
CGCGCGCCGG AAGGCGCCAA GATCGTCAGC ATCACCGTCA ATGCCGATTC CTGCAACCCG
AACGAGCTCT CCGTGCCGGC TGGGCGAACC GTGTTCGAAA TCGTCAACGC CTCACAGCGC
GTCGTCGAGT GGGAGATTCT CGACGGGGTC ATGGTGCTTG AGGAGCGCGA GAACATCGCC
CCCGGGATCA CCGCGCGGCT CACGGCAAAG CTCAATCCGG GCGTGTTCGA AATCACCTGC
GGCATGCTCA ACAATCCGCG CGGCAAGCTG ACCGTCACGC CATCCGCCCA ATCCGAGGCT
GAGGCGGCGC GGCCGCCGGC GACGGCGTTC ATCGGTCCGC TCGCCGAATA TCAGGTCTAT
CTCGCACTCG AGACGGAAGA TCTGATCGAG GCGACGCAGA ATCTCAGCGA GGCGATCAAG
GCCGGCGACC TCGATCGCGC CCGATCGCTC TATGAGCCGG CGCGCAGGCC CTATCTGCAT
GTCGCCCCGG CCGCGCAGCG GTTTGGCGAT CTCGACGCGG CGATCAACGC CGAGCCGGAT
TATTTTGAAA AGCGCGAGCA AGATCCCGCC TTCTCCGGCT TCCATCGCCT CGAATATGGG
CTTTTCGGCC AAACTAGCCT CGCTGGCCTT GCTCCCGTTG CCGAAAAACT GGCGAGCGAC
GTCGCGACGC TGAAGGAGCG CATTCGCGCT TTGAAAATCG CTCCCGAGGA TATTGCCGCC
GGCGCCTCGA AACGACTCGC CAAGAGCGCG GACGCCGCAG CCTCGGGAGT CAGCGAGCGC
TACGCCCATA CCGACCAGGC CGATTTCGAG GCCGACGTCG CCGGCGCCGC CAAGAGCTTT
GACGTGCTGC GCCCGCTGAT CGCCAAAGCC TCGCCGGATC TTCTCGCTCG CGTCGACGCG
GGTTTCAAAT CCGCCAAGGC TTCGATCGCT GCTTTGAAGA CGGGCGCTGA CGCCGGCGCC
GCGCGGGCGG CTGTGGCCGC CGATCTCCGT CAACTCTCGA GCGAACTCGG AAAGCTCAAC
GCCGCCATCG GACTGGACTA G
 
Protein sequence
MDATPSKKPP AGWGIKLLVG AAALLVLVAG VAFYLASKKA RAPEGAKIVS ITVNADSCNP 
NELSVPAGRT VFEIVNASQR VVEWEILDGV MVLEERENIA PGITARLTAK LNPGVFEITC
GMLNNPRGKL TVTPSAQSEA EAARPPATAF IGPLAEYQVY LALETEDLIE ATQNLSEAIK
AGDLDRARSL YEPARRPYLH VAPAAQRFGD LDAAINAEPD YFEKREQDPA FSGFHRLEYG
LFGQTSLAGL APVAEKLASD VATLKERIRA LKIAPEDIAA GASKRLAKSA DAAASGVSER
YAHTDQADFE ADVAGAAKSF DVLRPLIAKA SPDLLARVDA GFKSAKASIA ALKTGADAGA
ARAAVAADLR QLSSELGKLN AAIGLD