Gene Mlg_1254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1254 
Symbol 
ID4269176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1457851 
End bp1459002 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content70% 
IMG OID638126004 
ProductPyrrolo-quinoline quinone 
Protein accessionYP_742093 
Protein GI114320410 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID[TIGR03300] outer membrane assembly lipoprotein YfgL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00763008 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTGT TGAAACGATG CAACCGCCGC GCCCTTGTGG CACTGGCTGC CGTGCTGCTC 
CTTGCCGCCT GCGGTGGGGC GCGGGAACTG CCCGACCTCT CCGACGTGGG CGACGGGGTG
GCGACGGAGA CCCTTTGGAC CGCCTCCACC GGGTCGGGCA GCGCGTCTTC CGCCTACGCC
CTGGTGCCGG CGGTTGAAGG GGGGCGGGTC TATGCCGCTG ACAGCAACGG CCGGGTCACC
GCCTGGGACG CGGAAAGCGG GGAACGGCTT TGGCGGGTGG ATACCGGGCG CGAACTCGCC
GCAGGGCCTG GTGCCGGGGG CGGGTTGGTT CTGGTCGGGG CGCGCGACGG GCGCCTGCTG
GCCCTGGATG CCGAGAATGG CGAGGAGCGG TGGGTCTCCG GCCTGAGCAG CGAGATCCTC
GCGGTACCGC AGATCGCCCG CAACATCGTC GTCGCCCGCA GCGGTGACGG CCGTGTTTAT
GGCCTGGATG GACTGACGGG ACGGCGGCTC TGGATCCACG ATCGCAGCGT TCCGGTGCTG
ACCCTGCGGG GCAGCAGCAG CCCGGTGGTG GTCGGTAACC GGGTGGTGGT CGGACAGGAC
AACGGCCGTC TGGTCACACT CAACCTCCAG GATGGTGAGG TGATCTGGGA GGCGCCCGTC
TCTATACCGC GGGGTCGCTC CGACCTGGAA CGCATGGTGG ACCTGCACGC CGACCCCCTG
GTCTTCCGTG GGGTGGCCTA TGCCCAGGCC TATCAGGGTG AGCTGGCCGC CGTGGGCATG
GGGGACGGGC GGGAGCGCTG GTCCCGTGAT ATCCCCGGCC ATACCGGCAT GGCCGCAGAC
AGTCGCCAGC TCTACGTGGT GGACGACCAG TCCCGGCTGT GGGCCCTGGA CCGTAACAAT
GGCGCCACGG TCTGGCGCCA GGATCGGCTG CAGGGGCTGC GCCTGACCGC CCCGGTGGTG
ATCGGCGGCC ACCTGGTGCT GGCGGACGAG GAGGGTTATC TGAACTGGAT CGCCCCGGAC
AATGGTGATC TGGTGGGGCG GGATCGCCAC GGCCGGCAGC CGATCCAACG GCCGCCGGTT
CCCGATGGCG ATGTCCTGTA CCTGCTGTCG GCCGACGGCC GGCTGGCGGC GCTGAGGCTG
GTGGAGGACT GA
 
Protein sequence
MMLLKRCNRR ALVALAAVLL LAACGGAREL PDLSDVGDGV ATETLWTAST GSGSASSAYA 
LVPAVEGGRV YAADSNGRVT AWDAESGERL WRVDTGRELA AGPGAGGGLV LVGARDGRLL
ALDAENGEER WVSGLSSEIL AVPQIARNIV VARSGDGRVY GLDGLTGRRL WIHDRSVPVL
TLRGSSSPVV VGNRVVVGQD NGRLVTLNLQ DGEVIWEAPV SIPRGRSDLE RMVDLHADPL
VFRGVAYAQA YQGELAAVGM GDGRERWSRD IPGHTGMAAD SRQLYVVDDQ SRLWALDRNN
GATVWRQDRL QGLRLTAPVV IGGHLVLADE EGYLNWIAPD NGDLVGRDRH GRQPIQRPPV
PDGDVLYLLS ADGRLAALRL VED