Gene Mlg_2400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2400 
Symbol 
ID4269987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2726255 
End bp2728150 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content67% 
IMG OID638127158 
Producthypothetical protein 
Protein accessionYP_743230 
Protein GI114321547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.000233655 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTAA TACGTTCGTA TTGGGCCGCA TCATTTCATC CCCATTCACG TCCTTCGTTA 
CCGTTTTTTT CATTGCTGAC TTTCGTCATT CTGGGGCTGG CCCTGGCCGG GTGCAGCAGC
TCCAGCTCAT CCGGTGACCC ATCGGATGAG GAGGATAACG GCTTTTCCGT CAGTGTGAAC
GTCCGCGGGT TGGACTTGGC CATTGATCCC GGGACACCGC TGATACTGAA CGACCAGGCC
GAGGAGGCGC TGACCATTGA GGCGGATGGC AGCTATGCAT TCTCCGCCGA GCTGGCGTCC
GGCGACAGCT ATGGGGTCGA GGTGACGCAG CAGCCCAACG AACCGCGGCA ATTCTGCTTC
ATCGAGCGTC AGCAGGCCGT CTCCGGCGAG ATCACCGGCG ATGTGACCCT CAACGTGGAC
TGCGGCCTGG CCTACCACGG GCTGTTCCCC GGCTGGGGGG TGGGCGCGAC TCCGGACGAC
GGCTCGCTGC GGCCCCTGCA GATCGCCCCC GCCACGCTGG TGGCCCTGCG GGTCGACGGC
CAGGACGAGA TTGCCGTTGT CGATCCCTGC CAGGTGGAGA CCCGTGGTGA GGTGACCCAC
GCGGACGGTA CCTACGACCT GTCAGTGACC GCGGCCACCG CCTGTGACCC CACCCTGTTC
CTGAACACCA CCATGACGGA GGAGGCGTTC ACGGCGCAGA TCCCGGAGGG CATCGACGAG
TTGCCCGACG AGCTCGAGCT GAGCCGTTTC GCCGGCCCCG ACCGCGGCCT GCCCACCGCG
GCGCTGGGGC ATGCGGTGGC GGATGCCGAG GCGCAGGATC TGCTCCAGCA CAACCGCTGG
CTGCTCGCCG GCCTCACCGC CCGCCAGGCG GACGCCATGG CCGCCGAGGA GCTGCACGGC
ACGTGGGGCC TGGTGGCGCT CACCATGGAG CTGAACGACG AGGCTGAGCC CCAGTGGGTC
CGCCACAGCA GCCTCAGTGT GCCGGCGATC ATCGGCAACG ACGGCAACGG AGATCTGCTC
CTGAACGCGG ACGGCTGGCA GACGCTGACC GCCGCCGTGC AGGCCATTCA GGACGACCCG
CCGCCTCCGC CGATCACCCA GGGCTTCACC CAGGGGGCAC CCGGCGATGG CTTCAGCGAC
GTTCCGTTGA CCCTGAGCGC AGATGGCCGC TTGCAGGCCG GGGACAGCCA CGGTTTCGTC
TCCGCCGATG GCGACTTCTT CGTGCTGATC CATTCCGAAC CCACCCCGGA ACAGGTCCGG
GCGGACGAGG AGAACGGGCT GGGTGAGGGC CAGGGCGCCC ACCAGGTGCT GCTGGGCGTG
CGGCGCGACG AGAACCTGGT CTCGCTGGAC GGGCGGACTT ACGCCCTGTT CGGCCCCTCC
TGGTTCCTGG GCACGGAGGA GGACGGTGAC TCGCTGCAGG GCGTGGCCGA GTTCGAACTG
GCGCCCTTCC TGGAGGGCAC GGCACTGAGC TTCTCCGAGG GCGAGGTGAC GCTGACCCTG
GAGGAGGAGC AATGGATCGC GCCCTTCGGC GGCGGTGCCC TGGACATCGA CAGCGACTCC
GACTCACTCG CGGGACTGCC CTATACCATT GGTGACAATG AAGGCGACAA GCCGCAACTG
ATCGAGATCG ACCTGGGCGA TGGCTTCGGG CCCGATCGTG ACAACTACCT CACCGGCTAC
GCCCACGGCA AGCTGCTGGT ACTCGCCCTG GGCGTCCGCG ACGGCACCCA GGAGGAGGAG
GCGGCGAGTC TCAGTGCCGA GATCAACCTG CTCGGCCTCA TCACCCTGAG TGCCGACGCC
ACGGTGAACG GCGACGATGA CGCCATTCCG GCGGAGGACG TGAGTGTGGG CACGCTGATC
GGCATCTGCG TGCAGGGCTG CAACAACGAG TTCTGA
 
Protein sequence
MKLIRSYWAA SFHPHSRPSL PFFSLLTFVI LGLALAGCSS SSSSGDPSDE EDNGFSVSVN 
VRGLDLAIDP GTPLILNDQA EEALTIEADG SYAFSAELAS GDSYGVEVTQ QPNEPRQFCF
IERQQAVSGE ITGDVTLNVD CGLAYHGLFP GWGVGATPDD GSLRPLQIAP ATLVALRVDG
QDEIAVVDPC QVETRGEVTH ADGTYDLSVT AATACDPTLF LNTTMTEEAF TAQIPEGIDE
LPDELELSRF AGPDRGLPTA ALGHAVADAE AQDLLQHNRW LLAGLTARQA DAMAAEELHG
TWGLVALTME LNDEAEPQWV RHSSLSVPAI IGNDGNGDLL LNADGWQTLT AAVQAIQDDP
PPPPITQGFT QGAPGDGFSD VPLTLSADGR LQAGDSHGFV SADGDFFVLI HSEPTPEQVR
ADEENGLGEG QGAHQVLLGV RRDENLVSLD GRTYALFGPS WFLGTEEDGD SLQGVAEFEL
APFLEGTALS FSEGEVTLTL EEEQWIAPFG GGALDIDSDS DSLAGLPYTI GDNEGDKPQL
IEIDLGDGFG PDRDNYLTGY AHGKLLVLAL GVRDGTQEEE AASLSAEINL LGLITLSADA
TVNGDDDAIP AEDVSVGTLI GICVQGCNNE F