Gene Mlg_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1222 
Symbol 
ID4269753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1424889 
End bp1426181 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content65% 
IMG OID638125972 
Producthypothetical protein 
Protein accessionYP_742061 
Protein GI114320378 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.134791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAATA TCGTAGACCG ACGGCTCAAC CCCAAGGACA AGAGCCTGGC CAACCGCCAG 
CGCTTTCTGC GTCGGGCCAA GCGACAGGTG CTGGACGCGG TGCGGGACGC ATCCGCCAAA
CGCGGCGTGA AGGACATGCG CGGTGAGGGC GAGCAGATCA GCATTCCCGC GGATGGTCTG
GGGGAGCCGT CGTTCCGCAA GGGGGGCGAT ACCGGGGTGC GCGAGCATGT GCTGCCCGGT
AACAAGGAGT ACCGGGTGGG CGATGTCATC CCGCGGCCCG AGGGCGGCGG CGGGGGAGGC
GGTCGCTCCG AGGGCAGCCC CGACGGGGAG GGCCAGGACG AGTTCCAGTT CGTGGTCTCG
CGCGACGAGT TCCTGGACCT GTTCTTCGAG AACCTGGCCC TGCCGGACCT GGTTAAAAAG
GACATGAAGA AGACGGAGCG GTTCGCCAAC CAACGGGCCG GGTACAGCGT CAGTGGCTCG
CCGTCCAATC TCAACCTGAC GCGCACCATG CGCAACAGCC TGTCGCGCCG GATCGCCCTC
CGGCGTCCCA AGCGCGAACA CCTCCGCGAG CTAGAGCGAG AGATCGATGG GCTCGAACGC
AGTGGAAAGG AGCCGCAGCG GCTGCGCGCG TTGATCGAGG AACTGGAGCG GGAGACTCAC
CGGGCGCGCC AGATCCCGTG GATCGACCCT ATCGACATCC GGTACAACCG CTTCGAGCCG
GTGCCCCGAC CGGTCTCCCA GGCGGTTATG TTCTGCCTGA TGGATGTCTC CGGGTCCATG
ACCGAGGACA TGAAGGACCT GGCCAAGCGC TTCTTCATGC TGCTCTACCT CTTCCTGGAG
CGGCGGTACC GGCACGTGGA CGTGGTCTTC ATCCGTCACA CCCATATCGC CCAGGAGGTC
GATGAGGAGA CCTTTTTCTA CAGTCGCGAA ACCGGCGGCA CACTGGTCTC TCCGGCCCTG
GATGAGATGT ACCGCGTGCA GCGGGACCGC TACCCGGAAG AGAGCTGGAA TATTTACGCC
GCCCAGGCCT CCGACGGAGA CAACACGCCA GCGGACAACC CGCGGGTTAT CAAGATGATG
CGGGAGACCA TCCTGCCACT GACCCAGTAT TTCGCCTACA TCGAGGTGGG CGGACAAAGC
CTGCACATGG CGTCGGATCT GTGGCGGGCC TACGACAAGG TGGCCCGGAC CCACCCGGTA
CTGGCCATGC GGCGGGTGCG CCGGCGGGAT GAGATCTTCC CCGTCTTCCG CGATTTGTTC
ACGCCTGTGC AGAAGGCGAA GGCCGGAGCC TGA
 
Protein sequence
MVNIVDRRLN PKDKSLANRQ RFLRRAKRQV LDAVRDASAK RGVKDMRGEG EQISIPADGL 
GEPSFRKGGD TGVREHVLPG NKEYRVGDVI PRPEGGGGGG GRSEGSPDGE GQDEFQFVVS
RDEFLDLFFE NLALPDLVKK DMKKTERFAN QRAGYSVSGS PSNLNLTRTM RNSLSRRIAL
RRPKREHLRE LEREIDGLER SGKEPQRLRA LIEELERETH RARQIPWIDP IDIRYNRFEP
VPRPVSQAVM FCLMDVSGSM TEDMKDLAKR FFMLLYLFLE RRYRHVDVVF IRHTHIAQEV
DEETFFYSRE TGGTLVSPAL DEMYRVQRDR YPEESWNIYA AQASDGDNTP ADNPRVIKMM
RETILPLTQY FAYIEVGGQS LHMASDLWRA YDKVARTHPV LAMRRVRRRD EIFPVFRDLF
TPVQKAKAGA