Gene Mlg_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1113 
Symbol 
ID4269837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1302555 
End bp1303514 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content66% 
IMG OID638125865 
Productphosphate binding protein 
Protein accessionYP_741955 
Protein GI114320272 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.847548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAGC AAAGCTGCAT CGCCGGCCTG GCCGGAACCG TTTTTGCCGT CGGCGCCGCG 
GGTGCTGCCG AGGTGGATCC GAACCTGCCC AAGTACGAGC GGGTTTCCGG TATCTCCGGC
AACCTCTCCA GTCAGGGCTC CGACACCCTG AACAACCTGA TGACCCTCTG GGCCGAGACC
TTCAACGAGT TTTACCCCAA CGTCTCCATT GAGATCGAGG GCGCCGGCTC CAGCACCGCC
CCGGTGGCGC TGGCCGAGGG CACCGCCCAC TTCGGCCCCA TGAGCCGCTC CATGCGTGAC
TCCGAAATCC AGGCCTTCGA GGACGCCCAC GGCTACGAGC CTACCCTGGC GCGGGTGGCC
ATCGACGTGC TGGCCGTCTA CGTCAACCGC GACAACCCCA TCGAGGGGCT GACCATCGAG
CAGGTGGACG GTATCTTCTC CGAGACCCAG CGCTGCGGTG GTGGGAACAT CACCCGCTGG
GGTCAGGTGG GGCTGACCGA CGAGTGGCAG AACCGCGACT TCACCCTCTA CAGCCGCAAC
GCCGTCTCCG GCACCTACGG CTACTTCCGC GAGCACGCCC TCTGCGACGG TGACTTCAAG
GACAGCATCA ACGAGCAGCC GGGCTCCGCC TCGGTGGTGC AGGGCGTGAC CGAGTCCCTG
AACGGGATCG GCTATTCGGG CATCGGCTAC CAGACCTCCG GCGTGCGGCC CATTCCGCTC
GGTCGCGACA GTGAGCTGTT CGAGCCCAGC GCGGAGAACG CGGTGACCGG CGATTACCCG
CTGGCCCGGT TCCTGTACGT CTACGTCAAC AAGCACCCTA ACGAGGAACT GCCCCCGGTG
GAGCGGGAGT TCCTGCGCAT GATCTACACC CAGCAAGGGC AGGACGTGAC CGTTCGGGAC
GGGTTCATCC CGCTGCCGGC GGCCGCTGCC GAGCGGGAGA TGGAGCGGCT GGGGCTGTAA
 
Protein sequence
MWKQSCIAGL AGTVFAVGAA GAAEVDPNLP KYERVSGISG NLSSQGSDTL NNLMTLWAET 
FNEFYPNVSI EIEGAGSSTA PVALAEGTAH FGPMSRSMRD SEIQAFEDAH GYEPTLARVA
IDVLAVYVNR DNPIEGLTIE QVDGIFSETQ RCGGGNITRW GQVGLTDEWQ NRDFTLYSRN
AVSGTYGYFR EHALCDGDFK DSINEQPGSA SVVQGVTESL NGIGYSGIGY QTSGVRPIPL
GRDSELFEPS AENAVTGDYP LARFLYVYVN KHPNEELPPV EREFLRMIYT QQGQDVTVRD
GFIPLPAAAA EREMERLGL