Gene Mlg_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1725 
Symbol 
ID4268974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1973826 
End bp1975019 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content68% 
IMG OID638126483 
Producthypothetical protein 
Protein accessionYP_742561 
Protein GI114320878 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.685186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCTC GCTCCCCATA CCCGCATTAC GATGTCATCG TGATCGGCGC CGGCGCCGCC 
GGGCTGATGT GCGCGCTCAC CGCCGGCGGG CGTGGCCGCC GGGTGCTGGT GCTGGACCAT
GCCAACAAGG TGGGCAAGAA GATCCTGATG TCCGGCGGTG GGCGGTGCAA TTTCACCAAC
ATCCACTGCG GGCCCGAGCA CTTTCTGTCG GCCAACCCGC ACTTCGTCAA ATCGGCCCTC
AGTCGCTACA CCCCCTGGCA CTTCATCGCC TTGGTGGAAC AGCACGGCAT CCCCTACCAC
GAAAAGAAGC TGGGCCAGCT CTTCTGCGAC CGCTCCTCCA AGGATATCGT CGGCATGCTG
CTGGCGGAGT GCCGGGCGGT GGGGGTGGCG ATCCGTACCC GCTCGCCGGT CAGCGACCTG
CGGCTGGGGG CACCGCACTG GCTGTCCACT CCGCAGGGCC CGGTGACGTG TTCATCGCTG
GTGATTGCCA CCGGCGGCTA CTCGATTCCG AAGATGGGTG CCACCGGCTT CGGCTTCGAC
CTGGCGCGGT CGCTGGGGAT TCCGGTACGG CCCACCCGTG CGGCGCTGGT GCCGGTGACC
CTGGAGGGGC GCAAGCGGCG CCAGCTGCAG GACCTGGCCG GGGTGGCCCT GGACAGCGTC
ACCCGCGCCG GCGGGGCCGC CTTCCGCGAG AATATCCTGT TCACCCACCG TGGGCTCAGT
GGGCCGGCGA TCCTCCAGGC CTCGTCCTAC TGGCAGCCCG GCGAGCCGCT GGAGATCGAC
CTGTTTCCCG ACACGGATCT GGCCGGGCAC CTGGAGGCGA TGCGCCGGGA GCGCCCGCGT
CTGACCCTGA AGAAGCTGCT GGGCGAGCAA CTCACCCGCC GTGTGGCCCA GCGCTGGTGT
GAACTCTGGC TGCCGGACAG GCGCCTGGAG CAGTTGACCG GCGAGGACAT ACGCCGGATC
CAGCAGGCCT GCCAGCCCTG GACGGTCTGG CCCGATGGCA CCGAAGGGTA CCGTACCGCC
GAGGTGACCC TGGGCGGGGT CGACACCCAT GCGCTATCTT CCAAGACCAT GGCCTGTCGT
GATCACCCGG GGCTCTACTT CATCGGTGAG GTGGTGGATG TCACGGGCCA CCTGGGTGGT
CATAACTTTC AGTGGGCCTG GGCGTCGGGG CATGCGGCGG GGCAGCATGT GTAG
 
Protein sequence
MASRSPYPHY DVIVIGAGAA GLMCALTAGG RGRRVLVLDH ANKVGKKILM SGGGRCNFTN 
IHCGPEHFLS ANPHFVKSAL SRYTPWHFIA LVEQHGIPYH EKKLGQLFCD RSSKDIVGML
LAECRAVGVA IRTRSPVSDL RLGAPHWLST PQGPVTCSSL VIATGGYSIP KMGATGFGFD
LARSLGIPVR PTRAALVPVT LEGRKRRQLQ DLAGVALDSV TRAGGAAFRE NILFTHRGLS
GPAILQASSY WQPGEPLEID LFPDTDLAGH LEAMRRERPR LTLKKLLGEQ LTRRVAQRWC
ELWLPDRRLE QLTGEDIRRI QQACQPWTVW PDGTEGYRTA EVTLGGVDTH ALSSKTMACR
DHPGLYFIGE VVDVTGHLGG HNFQWAWASG HAAGQHV