Gene Mlg_2703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2703 
Symbol 
ID4269947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3068793 
End bp3070124 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content68% 
IMG OID638127464 
Producthypothetical protein 
Protein accessionYP_743533 
Protein GI114321850 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGAGG GGACCCCCAT GGAGCAGACG TCCGCCGCCC GCCAGCCGTC GGAACAGGCA 
GCCCGGCTGA GCGTCTGGGT GCTACTGCTG CTCGGGGCCC TCCTGCTGCA GCAACCCGTA
CCGAGTGCAC AGGCCGCTGA GAACGGCCCC GCCGGGGCGG ACACCATCAC CCTGCATTTC
TTCTGGACCC AGCAGTGCCC CCGCTGCATT GCCGCCCTGC CCGCCGTCCG CCGGCTGGCC
GAGGACTACG ACTGGCTGGA GGTGCGCAGC TACAACCTCA GCGCGGAGCC ACGGCACGGC
CAGGTCTACC GGGAACTGGC CAGCGCCCTG GGCGAGGAGG CCCGGGCGGT GCCGGGGTTT
GTCTTCTGCG ACGCCATGCT GGTGGGCTTC GATGACCACG GCCGGCAGGA GGCGCGATTG
CGCCAGTTGC TGGAGACCTG TCATGCGCAG ATCCAGGCCG GCGGGCCGCC GGTGCTGGAG
CGTGCCCTTT GGACCGAGGC GGAACCCATG CGGTTGCCGC TGCTCGGCGA GGTGCGGCCG
GATGACCTCT CGCTGCCGGC ACTGACCCTC CTGCTGGCCG GGTTCGACGC CTTCAACCCC
TGCGCCTTCT TCGTGCTGCT GTTCCTGCTC AGCGTGGTGG TGCACAGCCG CAGCCGCGGC
CGTATCCTGC TGATTGGCGG CATCTTCGTG ACCATCTCCG GGGTCATTTA CTTCACCTTC
ATGACCGCCT GGCTGAACGC CTTCCTGGTC TTTGGCGAGA TGCCGCTGGT GACCCGGCTG
GCCGGGCTGG TGGCGGTGAC CATGGCGCTG ATCAACATCA AGGACTACTT CTGGTTCAAG
CGCGGCGTGT CACTGAGCAT TCCGGACTCG GCCCGGCCGG GGCTGTTCCG GCGTATGCGC
GCACTCACCA CCGCCGACAG TCTGCCCTGG GTGCTGGGGG CCACGCTGAT CCTCGCCGTG
GTGGTGAACC TCTACGAGAT CCTCTGCACC ATGGGCTTCC CCATGATCTA CACCCGCATC
CTCACCGCCC ACGACCCGGG CGCGGTGGGA TACTACGGCT ACCTGCTGGC CTACAACGTG
ATCTACGTGC TTCCCATGCT CATCATCGTG GCGCTGTTCG CCTTCACCCT GGGCAATCGC
AAGCTGCAGG AAGACGAGGG GCGGCTGCTG AAACTGCTCT CGGGGATGAT GATGCTGGGT
CTGGGACTGA TGCTGCTGCT GCGTCCGGAC CTGCTGGCCA ACCCCCTGTT CGCCGTCGGC
GTCATCTTCC TGGCCCTGCT GGCCACCGGG CTGGTCCGCC GTCTGTCCCC GCGAGGGTCA
GGCGGTCGAT GA
 
Protein sequence
MQEGTPMEQT SAARQPSEQA ARLSVWVLLL LGALLLQQPV PSAQAAENGP AGADTITLHF 
FWTQQCPRCI AALPAVRRLA EDYDWLEVRS YNLSAEPRHG QVYRELASAL GEEARAVPGF
VFCDAMLVGF DDHGRQEARL RQLLETCHAQ IQAGGPPVLE RALWTEAEPM RLPLLGEVRP
DDLSLPALTL LLAGFDAFNP CAFFVLLFLL SVVVHSRSRG RILLIGGIFV TISGVIYFTF
MTAWLNAFLV FGEMPLVTRL AGLVAVTMAL INIKDYFWFK RGVSLSIPDS ARPGLFRRMR
ALTTADSLPW VLGATLILAV VVNLYEILCT MGFPMIYTRI LTAHDPGAVG YYGYLLAYNV
IYVLPMLIIV ALFAFTLGNR KLQEDEGRLL KLLSGMMMLG LGLMLLLRPD LLANPLFAVG
VIFLALLATG LVRRLSPRGS GGR