Gene Mlg_1027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1027 
Symbol 
ID4269768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1171847 
End bp1173487 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content68% 
IMG OID638125779 
Producthypothetical protein 
Protein accessionYP_741870 
Protein GI114320187 
COG category[S] Function unknown 
COG ID[COG4425] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.847677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGAG TCACCCGAAT CTTCGGGCAG TTCTCCACGG CGGGGCTGCT GTTGGGCACC 
CTGTTTTTCG CCTTCTCGCT GACCCCGAGC CTGTTGCCCC GGCCTTTCCT GGTGCAGGGT
GTCATCTCCG GGCTGTCGTT CGCCGCAGGG TATGCCCTGG GCTTCGCCGG CCAGTGGCTG
TGGGCCTATT TGGAATTGCC CGCCCCCCGC GCCCGCCTGG GCAGTGCCGT GAAACTGCTC
GCGACCCTGG CCTGCGTGGT GATCGCGGGC CATTTCCTGG CCCGGGCCTC GGAGTGGCAG
AACTCCGTGC GGACCCTGAT GGGGGTGGAG CCTGTGAGCG GGATCCGGGC CTACAGCATC
GCGTTGATTG CCCTAGCGGT CTTCGCCCTG TTGTTGCTGC TGGCACGCCT TTTCCGTCAC
ACCTTCCTGC TGCTCTCGGC GCGGCTGCAG CGCCATGTGC CCCGCCGGGT GTCGCACGTG
GCCGGGATCG GCGCCGCGCT GCTGCTGTTC TGGTCGGTCA TCGACGGGGT GATCTTCACC
CTGGGCCTGC GCGCCGCCGA TAACTCCTAC CAACAAGTGG ATGCCTTGAT CCAGGATGAC
TTGGATCCGC CGGAGGACCC GATGCGTACC GGCAGCGCCG CCTCCCTCAT CACCTGGGAG
GAGTTGGGCA GCCGCGGGCG CCGGTTCGTC AGCAGCGGGC CGACAGCGGA GGACCTGCGC
TGGTTTCACG GCGAACCGGT GCCGGAGCCC ATTCGGGTCT ATGTGGGGTT GAACGCGGCG
GAGACCCCGG AGGCCCGGGC CGAGCTGGCC CTGGAGGAGC TCAAGCGGGT GGGTGGTTTC
GACCGCTCGG TGTTGCTGAT CGCAACCCCC ACCGGGCGGG GTTGGGTGGA CCCGGCCGCC
CAGGAACCGG CCGAGTACCT GCACCGTGGC GATATCGCGA CGGTGACCGC GCAGTACTCC
TACCTGCCCA GCCCCTTGTC GTTGCTGGTG GAGGGTGACT ACGGGGTGGA GACCGCCCGC
GCCCTGTTTC AGGCCGTGTA CGGGCATTGG AGCCGCCTAC CGGAGGACGA GCGGCCCCGC
CTCTATCTCC ACGGTCTGAG CCTGGGGGCG CTGAATTCCG ATCGCTCCTT CGATGTCTAC
GACATCATTC AGGATCCGTT CGACGGGGCG CTCTGGAGCG GTCCCCCCTT TCGCAGCGAG
ACCTGGCGTA CCGTCACCCG CGGCCGGGAC GCCGGATCAC CGGCCTGGTT GCCCCGGTTC
CGTGACGGCT CGGTGGTCCG ATTCATGAAC CAGTACGAGG GCCTGGAGGA TCAGGGTGAT
GAGTGGGGGC CCTTCCGGAT CGCCTTCCTG CAGTATGCCA GCGACCCGGT GACGTTCTTT
GATCCCGCCG TGCTCTATCG TGAACCGGAA TGGATGCGGG AGCCGCGTGG CCCGGATGTC
TCCACCGAAC TGCGCTGGTA CCCGGTCGTC ACGATGTTGC AGCTGCTGGC CGATATTGCG
GTGGGAGGGG CACCCCGGGG GCATGGCCAT GAGATCGCCG CCGAACACTA TGTCGATGCC
TGGGTGGCGC TGACCGAGCC GGAGGGCTGG TCTGAGTCGG AGCTGGACCG GCTGCGCGGC
CGGTCCCGGC CGGAGGAGTG A
 
Protein sequence
MRRVTRIFGQ FSTAGLLLGT LFFAFSLTPS LLPRPFLVQG VISGLSFAAG YALGFAGQWL 
WAYLELPAPR ARLGSAVKLL ATLACVVIAG HFLARASEWQ NSVRTLMGVE PVSGIRAYSI
ALIALAVFAL LLLLARLFRH TFLLLSARLQ RHVPRRVSHV AGIGAALLLF WSVIDGVIFT
LGLRAADNSY QQVDALIQDD LDPPEDPMRT GSAASLITWE ELGSRGRRFV SSGPTAEDLR
WFHGEPVPEP IRVYVGLNAA ETPEARAELA LEELKRVGGF DRSVLLIATP TGRGWVDPAA
QEPAEYLHRG DIATVTAQYS YLPSPLSLLV EGDYGVETAR ALFQAVYGHW SRLPEDERPR
LYLHGLSLGA LNSDRSFDVY DIIQDPFDGA LWSGPPFRSE TWRTVTRGRD AGSPAWLPRF
RDGSVVRFMN QYEGLEDQGD EWGPFRIAFL QYASDPVTFF DPAVLYREPE WMREPRGPDV
STELRWYPVV TMLQLLADIA VGGAPRGHGH EIAAEHYVDA WVALTEPEGW SESELDRLRG
RSRPEE