Gene Mlg_1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1728 
Symbol 
ID4268977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1977731 
End bp1979158 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content71% 
IMG OID638126486 
Producthypothetical protein 
Protein accessionYP_742564 
Protein GI114320881 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCT CACGTTTGAC CCGCATCGAC GAGACGAGCT GGCAGATTGA GCCCAGTGGC 
GCCATGCGGG TGCCCGGCAT CATCTACGGT GACGAGACGC TGATCCGGGC CATGGACGAC
AAGGTGGCCG AGCAGACCGC CAACGTCGCC TCGCTGCCCG GTATCGTCAA GGGCGCCTAC
GTGATGCCGG ACGCCCACTG GGGCTATGGC TTTCCCATCG GCGGCGTGGC CGCCTTCGAC
CCGAATGAGG GCGGGGTGGT GTCCGCCGGC GGCGTGGGCT TCGATGTCTC CTGCGGGGTG
CGCACCCTGC TCACCGGGCT GACGGTGCCG GAGGTGCGCC GTGTGCAGGA GCGCCTGGCC
GATGCCCTGA TGTCGAGCAT CCCCGCCGGG GTCGGCAGCC ACAGCGGCAT CACCCTGCAG
GGCCGGGAGA TGGATGCCAT GCTGCGCGGC GGCGCCGCCT GGGCGGTGGA GAAGGGATGG
GGCGGGGCCG AGGACCTGGC GCGCATTGAG GAGCGGGGCC GGATGGAGGG GGCCGATCCC
GACTGCGTGT CCGAGCGGGC CAAGAAGCGC CAGCGCCGGG AGATGGGCAC TCTGGGCTCG
GGCAACCACT ACCTGGAGGT GCAGGCGGTG GAGACCCTCT ACGATCCGGA TACCGCGGCC
GTGCTCGGCC TGGCCGAGGG CGATGTGGTG GTGACCATTC ACTGCGGCTC GCGCGGTCTG
GGCCATCAGA TCGGCACCGA GTTCCTGCGT GACATGCTGC CCGCCGCCGC CGAGGCTGGC
ATCCACCTGC CGGACCGGGA ACTGGCCTGT GCCCCCATCC ACTCGCCCAT CGGCGAGCGC
TACCTGGGGG CCATGCGCAG CGCCATTAAC TGCGCCCTGG CCAACCGGCA GATCCTGGGG
GAGTTCGCCC GCGAGGTCTT CGCCCGTTTC TTCCCCGACC ATCCGCTGGA CCTGCTCTAC
GACGTCTCGC ACAACACCTG CAAGGTCGAG ACCCATACCG TGGATGGCCG GCCGCGCCGG
CTTTTCGTCC ACCGCAAGGG CGCCACCCGG GCCTTTGGCC CGGCCCACGC CGATCTGCCC
GAGGCGCTGC GTGGGGTCGG TCAGCCGGTG CTGATCGGCG GCAGCATGGG CACCGGCTCG
CACATCCTGG TGGGCACGGG GGCCGGCGAC AAGGCCTTCT CCTCCGCCTG CCACGGTGCC
GGCCGGGCCA TGTCCCGGCG CGCGGCGCTG AAGCGCTGGC GCGGCCGCCA GGTGGTGGAC
GAACTGGCCG AACGCGGGAT CCTGATCCGC AGTCCCTCGA TGCGCGGTGT GGCGGAGGAG
GCCCCGGGCG CCTACAAGGA TGTGGACCAG GTGGTGATCG CCGCCGAACG GGCGGGGCTG
GCGCGCCGGG TGGCGCACCT GCGCCCGCTG ATCTGCGTCA AGGGGTGA
 
Protein sequence
MDTSRLTRID ETSWQIEPSG AMRVPGIIYG DETLIRAMDD KVAEQTANVA SLPGIVKGAY 
VMPDAHWGYG FPIGGVAAFD PNEGGVVSAG GVGFDVSCGV RTLLTGLTVP EVRRVQERLA
DALMSSIPAG VGSHSGITLQ GREMDAMLRG GAAWAVEKGW GGAEDLARIE ERGRMEGADP
DCVSERAKKR QRREMGTLGS GNHYLEVQAV ETLYDPDTAA VLGLAEGDVV VTIHCGSRGL
GHQIGTEFLR DMLPAAAEAG IHLPDRELAC APIHSPIGER YLGAMRSAIN CALANRQILG
EFAREVFARF FPDHPLDLLY DVSHNTCKVE THTVDGRPRR LFVHRKGATR AFGPAHADLP
EALRGVGQPV LIGGSMGTGS HILVGTGAGD KAFSSACHGA GRAMSRRAAL KRWRGRQVVD
ELAERGILIR SPSMRGVAEE APGAYKDVDQ VVIAAERAGL ARRVAHLRPL ICVKG