Gene Mlg_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0101 
Symbol 
ID4268839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp110648 
End bp111748 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content70% 
IMG OID638124827 
Producthypothetical protein 
Protein accessionYP_740948 
Protein GI114319265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.867237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCGC GCCCGCTGGA GCACCTGCTG ACCGCACTGA CCGACCCCGC CTCCGTCGGC 
GAACTGCGCC TGGGCGATTG GGACGCCCTG CTGCGGGTGG CCCGGGTCGC CTCACTGGAG
GCACGCCTGC ACGCGCTGTT ACAGGAGCGG GATCTGTTCG ACCGGGTGCC CGCCCGACCG
CGGCGCCACC TGGAGGCGGC GGGCCGGGTA GCCGCCGAAC AACACCACCG GATGCGCTGG
GAGGTGGAAC AGGTCCATGA GGCGCTGGCC GCGCGCAAGG CGCCGGTGGT GATTCTCAAG
GGGGCGGCCT ACCTGATGGC CGGCCTGCCC AGCGCCCGCG GCCGGCTGTT CGCGGACCTG
GACATCATGG TCCCACGCCC GGCACTGGCG ACCACCGAAC ACGTCCTGTT CACCCGCGGC
TGGCTGGCGC AGGGCCACGA CGAATACGAC CAGACCTACT ACCGGCGCTG GATGCATGAA
CTGCCCCCGC TGACCCACAT CCGGCGAAAG AGCGTGCTGG ACGTCCACCA CACCGTCCTG
CCGCCGACGG CGAGATTGCA CCCGGACCCG GACAAGCTAT TCGCCGCCGC CACGCCGCTC
CCGGGCTGGC AAAACCTCTA TGTGCTGGCC CCCACCGACA TGGTGCTGCA CAGCGCGACC
CACCTCTTCC ACGACGGCGA GTTGGAGAAC GGCCTGCGGG ATCTGGTCGA TCTGGATGAC
CTGGTCCGCC ATTTCCACCG CCACGTCGAC GGCTTCTGGC CCGCGCTGGT GGACCGGGCC
CATGAGATGG ACCTGGCGCG CCCGCTGTTC TACGGCCTGC GGTATGCGGC CCACTTTCTG
AACACCCCGG TGCCGGCGAC CGTCAACGAG GGGCTCGCCG CAGCCGGCCC GGGCGTGCCA
CTGCGCCCGC TCATGGACGG GCTGTTCCGG CGCGGCCTCG CGCCCCACCA TTGGCAATGC
GACGATTGGC TTTCCCCGAC CTGCCGATGG ATACTTTACG TAAGGTCGCA CTACCTGCGC
ATGCCCTTGC GCCTGCTGGT ACCCCACCTC ACCCGCAAGG CCATCAAGAG ACGAATGGCG
GCCCCCGAGG CCCACGCCTA G
 
Protein sequence
MMPRPLEHLL TALTDPASVG ELRLGDWDAL LRVARVASLE ARLHALLQER DLFDRVPARP 
RRHLEAAGRV AAEQHHRMRW EVEQVHEALA ARKAPVVILK GAAYLMAGLP SARGRLFADL
DIMVPRPALA TTEHVLFTRG WLAQGHDEYD QTYYRRWMHE LPPLTHIRRK SVLDVHHTVL
PPTARLHPDP DKLFAAATPL PGWQNLYVLA PTDMVLHSAT HLFHDGELEN GLRDLVDLDD
LVRHFHRHVD GFWPALVDRA HEMDLARPLF YGLRYAAHFL NTPVPATVNE GLAAAGPGVP
LRPLMDGLFR RGLAPHHWQC DDWLSPTCRW ILYVRSHYLR MPLRLLVPHL TRKAIKRRMA
APEAHA