Gene Mlg_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2271 
Symbol 
ID4268234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2575419 
End bp2576510 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content66% 
IMG OID638127028 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_743103 
Protein GI114321420 
COG category[R] General function prediction only 
COG ID[COG4174] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGGGT ATGTCGCCCG GCGCCTGCTG CTCATGATCC CGACCCTGCT TGGGATCATG 
GTCATCAACT TCGCCATCGT GCAGTTCGCC CCCGGCGGGC CCATCGAGCG GATCGCCGCC
CAGGTGCAGG GCAGCATGGC GGATGCCACC GGTCGCTTTA CCGGGGTGGA TGCCCGCGAG
GCGGGCGACG CCACCGGGAT GGCCGACGAG GTCTCCCGTG GCGCCCGCGG CCTCCACCCG
GAGTTCATCG CCGAGTTGGA GGCGCAGTTC GGCTTCGACC GGCCCGCCCA CGAGCGCTTC
CTGCAGATGA CCTGGAACTA CCTGCGGTTC GACTTCGGCG AGTCGTTCTA CGCCGACCGG
ACCGTCATCG AGCTGATCCG CGATCGGCTG CCGGTCTCCA TCTCCCTGGG CCTGTGGACC
ACCGTGCTGG TCTACCTCAT CTCCATCCCC CTGGGCATCC GCAAGGCGGT GCGCGACGGC
AGCCGCTTCG ATCTCACCAC CTCGGCGGTG GTCTTTGTCG GCTACGCCAT CCCCAACTTC
CTGTTTGCCA TCCTGCTCAT CGTGCTCTTC GCCGGCGGCT CCTGGCTGGA TCTCTTCCCG
CTGCGGGGGC TGGTCTCCGA CAACTGGCAC GATTTGAGCT GGCCCATGCG CATCCTCGAC
TACCTGCACC ACATCACCCT GCCGGTGCTG GCCATGGTGA TCAGCGGTTT CGCCGGGCTG
ACCATGCTCA CCAAGAACAG CTTCCTGGAG GAGGTGAACA AGCAGTACGT GATGACCGCC
CGTGCCAAGG GCTGCACCGA GCGCGGCGTG CTCTATGGCC ACGTCTTCCG CAACGCCATG
CTCATCGTTA TTGCCGGCTT CCCGGCCGCC TTTATCGGCA TCCTCTTTAC CGGGGCGTTG
CTGATTGAGG TGATCTTCTC CCTGGACGGG TTGGGGCTGC TGGGCTTCGA GGCGGTGGTG
AACCGGGACT ACCCGGTGGT CTTCGGCACC CTGTTCATCT TCACCCTGCT CGGGCTGGTG
CTGAACCTCA TCGGCGACCT GATGTACGTG GCCATCGACC CGCGGATCGA CTTCGAGCGG
AGGGCGGGCT GA
 
Protein sequence
MWGYVARRLL LMIPTLLGIM VINFAIVQFA PGGPIERIAA QVQGSMADAT GRFTGVDARE 
AGDATGMADE VSRGARGLHP EFIAELEAQF GFDRPAHERF LQMTWNYLRF DFGESFYADR
TVIELIRDRL PVSISLGLWT TVLVYLISIP LGIRKAVRDG SRFDLTTSAV VFVGYAIPNF
LFAILLIVLF AGGSWLDLFP LRGLVSDNWH DLSWPMRILD YLHHITLPVL AMVISGFAGL
TMLTKNSFLE EVNKQYVMTA RAKGCTERGV LYGHVFRNAM LIVIAGFPAA FIGILFTGAL
LIEVIFSLDG LGLLGFEAVV NRDYPVVFGT LFIFTLLGLV LNLIGDLMYV AIDPRIDFER
RAG