Gene Mlg_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1114 
Symbol 
ID4269838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1303607 
End bp1304704 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content67% 
IMG OID638125866 
Productporin 
Protein accessionYP_741956 
Protein GI114320273 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.600912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA GATTACTCGC CCGGGTCACC GGTCTAGCCC TTCTCGCCCC ACTGGCATGG 
ACGGCTAATG CCGCCGATCC ATCGGTAGAT GTCTACGCCG TGCTCCATTT GTCGCTGGAT
CACCTGGACA ACCGCGAGAG CGACAGCCAG TTCCTCTCCA CCAATCAGTC CCGCCTGGGC
ATCCGGGGCA GCACCGCCCT GTCCGCCGAC ACCCGTGCCC TCTTCCAGTA CGAAACTGAA
GTCAATGCCA CGGAAGGCGG CTCGGGGCTC TTCCGCCGCA GCCGCCACAG CTTTCTGGGC
CTGAGCGGGC CCTACGGCAC CGTCCGTGGG GGCAACCTGG ACGGCCCCCT CAAGGCCCTG
ACCGACCGCA CCCAGTTCTT CACCGCCCGA CTGGGTGACC CCGGCAACCT GATTGCCGGT
GCCGGCGTGA CCTGGGAGGA CACCATCGGC GCGGCAGACG CCCCCGGCCA CCTGCGCCGG
CACAGCAATG CGATCGACTA CACCACACCG GAATGGCAGG GCCTGAGCGC CACACTCATG
GGCACCCCGG CACAGGGTGA ATCCAGTGCC CAGACCGGTT CCTGGATGGT GCGCTGGCAA
CAACCCGCCT TTCAGCTGGC CGCCGGCTGT GTTCACAGCC GTTCCGGCAA CTTCGCCAAC
GGCGATCGTA GCCAGACGAC CCGCCAGCTG CTCGCGCAAT ACCGCGAAGG GGCCATCAAC
CTGGTCGCCA TTGTCCAGGA CCACCAACAC ATATCTGGAC GGGGGGACCG GGATGCCCGC
GCCGGCCTTC TCGGCCTGGG CTACCGGGTC GCACCCGGCC TCGAACTCCA GGGGCAGGTC
GCGCACTTCG ACGACGACCG CGGCAGTGAC CATGACTCCA CCCTTTACAC CGTGGGTGTG
GAGCATGCGA TGAATCCGCG GGCCCGGGTG TATCTGAATT ACGCCCAGGT CCGCAACGGG
GATCTGGCCG GCCGTAGCGT GGCAGGGCAG TCCCATGCCC CGCCACCCGG GCCTGACAGC
AGCCGCAGCC GCATGCTGGA GGTGGCGGAC GGCAACAACC AGTGGGGGGT GTCCGCCGGG
ATGCTTTACG TCTTTTAA
 
Protein sequence
MKRRLLARVT GLALLAPLAW TANAADPSVD VYAVLHLSLD HLDNRESDSQ FLSTNQSRLG 
IRGSTALSAD TRALFQYETE VNATEGGSGL FRRSRHSFLG LSGPYGTVRG GNLDGPLKAL
TDRTQFFTAR LGDPGNLIAG AGVTWEDTIG AADAPGHLRR HSNAIDYTTP EWQGLSATLM
GTPAQGESSA QTGSWMVRWQ QPAFQLAAGC VHSRSGNFAN GDRSQTTRQL LAQYREGAIN
LVAIVQDHQH ISGRGDRDAR AGLLGLGYRV APGLELQGQV AHFDDDRGSD HDSTLYTVGV
EHAMNPRARV YLNYAQVRNG DLAGRSVAGQ SHAPPPGPDS SRSRMLEVAD GNNQWGVSAG
MLYVF