Gene Mlg_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1199 
Symbol 
ID4270687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1398723 
End bp1400129 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content69% 
IMG OID638125948 
Producthypothetical protein 
Protein accessionYP_742038 
Protein GI114320355 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0215661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.10586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC TGCGCCTCAC TCTCAACTGG TTGCGCTTCC GTCCCCTCCC CGCGCTGCTC 
AATCTGCTGT TGATGGCGCT GGGCACCGGC ACCATTGCCC TGCTACTGCT ATACGGCCAT
CAACTGGAGC GCCAGTTCAC CCGCGATGCC CAGGGTATCG ACCTGGTGGT GGGCGCCAGT
GGCAGCCCCA TGCAGCTGAT CCTGTCCAGC GTTTATCACC TGGACATCCC CACCGGCAAC
ATCCCTGAAC GAACCGCCCG CGAGCTGGCG GACCACCGGC TGGTGAGCGA GGTCATCCCC
CTGGCCCTGG GTGACAACTA CCGCGGCCAC CGCATCGTGG GCACGGACGC GGGCTACGTG
GATCTCTACC GGGGGGAACT CGCCGAGGGG CGCCTTTGGG AGCAGGCCAT GGAGGCCACC
CTCGGATCCG CGGTGGCAGC CCGCCATGGG CTCGCCATCG GCGATGAGAT CGTCGGCGCC
CACGGCCTGG GTGGCGGCCA TGGCCATGTC CACGACTACG CCCCCTACAC CGTCGTCGGC
ATACTGGCCC CCACCGGCAC GGTCATGGAC CGGCTGGTCC TGACCTCGGT GCAGAGCGTC
TGGGACGTCC ATGACGACGA CCACGATCAC GACCACGATC ACGACCACGA TCACGACCAC
GATCACGACC ACGATCACGA CCATGAGCAC GACCATGAGC ACGACCACAG CGACCACCCT
GAGGCCGGCC ATGGCCATGA CCATGAGGCC GACCGGGCTC ACACCCACGA GCCGGCCCCG
GATCACGACC ACGGGCATCC GGCCGGGCAA GGGCATGACC ACCGGGAGCC GGGCGCCGGC
CACGACGATG TGGCCACCGC AGACCAGGAA CTGACCGCCC TGCTGGTCCG CTACCGCTCA
CCGCTGGCTG CCATGCAACT GCCCCGCGCC ATCAACGCCG AGGCCGGCCT GCAGGCCGCC
TCTCCGGCCT ACGAGAGCGC GCGCCTCATG AGTATGATGG GTGTCGGCCT GGACACCCTG
AAGGCCTTCG GCGGGGTGCT GCTGCTGGCC GCCGGCCTGG GCGTCTTCAT TGGCCTCTAC
AACGCCCTGC GGGAACGGCG GCACGACATC GCGATCATTC GCAGTCTCGG CGCCTCGCCC
CGACTGGTGA GTGGCCTGGT GCTCCTGGAA GGCCAGCTTC TGGCCCTCAC AGGCACCCTA
TTGGGCCTTG CCGGCGGCCA CCTGTCGGCC GAGTTGATCG GCCGCTGGAT CGGCCGCGAC
CGGCCGCTGG AGCTGACCGG CCTGACCTGG GTGCCGAGCG AGGGCTGGCT ACTGCTGATC
GCCGCCGGCA TCGGCCTGGT CGCCGCCCTG CTGCCGGCCT GGCAGGCCTA TCGAACCGAC
ATCGCCCTGA CGTTGTCTGA GCGGTAG
 
Protein sequence
MNALRLTLNW LRFRPLPALL NLLLMALGTG TIALLLLYGH QLERQFTRDA QGIDLVVGAS 
GSPMQLILSS VYHLDIPTGN IPERTARELA DHRLVSEVIP LALGDNYRGH RIVGTDAGYV
DLYRGELAEG RLWEQAMEAT LGSAVAARHG LAIGDEIVGA HGLGGGHGHV HDYAPYTVVG
ILAPTGTVMD RLVLTSVQSV WDVHDDDHDH DHDHDHDHDH DHDHDHDHEH DHEHDHSDHP
EAGHGHDHEA DRAHTHEPAP DHDHGHPAGQ GHDHREPGAG HDDVATADQE LTALLVRYRS
PLAAMQLPRA INAEAGLQAA SPAYESARLM SMMGVGLDTL KAFGGVLLLA AGLGVFIGLY
NALRERRHDI AIIRSLGASP RLVSGLVLLE GQLLALTGTL LGLAGGHLSA ELIGRWIGRD
RPLELTGLTW VPSEGWLLLI AAGIGLVAAL LPAWQAYRTD IALTLSER