Gene Mlg_2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2061 
Symbol 
ID4270447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2335350 
End bp2336678 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content69% 
IMG OID638126817 
Productmajor facilitator transporter 
Protein accessionYP_742893 
Protein GI114321210 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.310539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGG ATCATCACTG GTATGCCAAT CCGACACTGG ACAAGGCCTA CCGGACGCTG 
GTGGACGAGG AGGATGCGCG GGTCTGCCGC GACATCAGTG ACGAAGCCTG TCAGGTGGTG
CCGGGCAACT TTTTCCTGCA GATCCTCAGC CATTTCTTCA CCAAGCTGGG TGATGCGGTG
GCCAACCCGA AGACGGTGCT CGCCTGGCTG CTCAGCGCGC TCTCGGCCCC GGGCTTTTTC
ACCGCCCTGC TGGTGCCCAT CCGCGAGTCC GGCTCGCTGA TCCCGCAACT GTTCATCGCC
AGCTACGTGC GGCGCCTGGC GCGACGCCAA TGGGCCTTCG TCGTCGGTTG TATCCTGCAG
GCGGTGGCGG TACTGGCCAT GGCCCTGATC GCGGTGGGCC TGGAGGGCGC CGCCGCGGGC
ACCGCGTTGA TCGGCGCACT GGTACTGTTC AGCCTAGCCC GCGGGCTCTG CTCTGTGGCC
TCCAAAGACG TGCTCGGTAA GACCGTGCCC AAGACCCGGC GCGGCCAGGT CAACGGCTGG
TCCGCCTCCG CGGCCGGCCT GGTGACCATC GGCGTGGGCG CCCTGTTGCT GCTGGGAGGG
GGCAGCCCTG GCGAGACCGG CATCTATCTG TTGCTGCTCG GCGGGGCGGC CCTGCTCTGG
CTGCTGGCGG CGGCCGGCTA TGGCGCGATT CGTGAGTACC CCGGGGCCAC CTCCGGCGGC
GGCAATGCCT TCACCGAGGC CGTCCAGCGC CTGGACCGGT TGCGCACCGA CGAGCCCTTC
CGGCGCTTTG TCATCGCCCG CGCCCTGCTG CTCTGCTCGG CGCTCACCGC CCCCTTTATC
ATCATGCTGG CCCATGAGCA GACCGGGGGC GCGGCGCTGG TCCTGGGCCT GTTTGTCATC
GCAGATGGCC TGGCGAGCCT GGTCTCCGCC CCCTTCTGGG GCCGGTTCGC CGACACCTCC
AGCCGGCGGG TGATGGTGGT CGCCGGGGCC GGCGCGGGGA TGGTGGGCCT GGGACTGGTC
CTGCTGGTCC AGGCGCTGCC GCCACTGGCG GGCAGCGCCT GGCTGTACCC GCTGTTCTTC
TTCCTGCTGG CCATCGCCCA CGCCGGCGTG CGGCTGGGCC GGAAGACCTA CGTGGTGGAC
CTGGCGGGTG GGGACAAACG CACCGATTAC GTGGCGGTCA GTAATACGGT GATCGGGGTG
GTGCTCCTGC TGATGGGGGG GGTCGGATTG CTGACGGCGG TGATACCGGT CTCCGGCGTC
ATCCTCATCC TGTCAGGGAT GGGGATCGCC GGGGCTTGGC TGTCCGCCCG CCTGCCCGAG
GTCACCTGA
 
Protein sequence
MAADHHWYAN PTLDKAYRTL VDEEDARVCR DISDEACQVV PGNFFLQILS HFFTKLGDAV 
ANPKTVLAWL LSALSAPGFF TALLVPIRES GSLIPQLFIA SYVRRLARRQ WAFVVGCILQ
AVAVLAMALI AVGLEGAAAG TALIGALVLF SLARGLCSVA SKDVLGKTVP KTRRGQVNGW
SASAAGLVTI GVGALLLLGG GSPGETGIYL LLLGGAALLW LLAAAGYGAI REYPGATSGG
GNAFTEAVQR LDRLRTDEPF RRFVIARALL LCSALTAPFI IMLAHEQTGG AALVLGLFVI
ADGLASLVSA PFWGRFADTS SRRVMVVAGA GAGMVGLGLV LLVQALPPLA GSAWLYPLFF
FLLAIAHAGV RLGRKTYVVD LAGGDKRTDY VAVSNTVIGV VLLLMGGVGL LTAVIPVSGV
ILILSGMGIA GAWLSARLPE VT