Gene Mlg_0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0367 
Symbolrho 
ID4268958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp413143 
End bp414399 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content63% 
IMG OID638125098 
Producttranscription termination factor Rho 
Protein accessionYP_741212 
Protein GI114319529 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.151601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGA CGGAACTAAA ACAGAAACCC GCCGCTGAAC TGATGAAACT GGCCCAGGAA 
CTGGGCGTGG AGGGGACGGC CCGCTCGCGC AAGCAGGACG TCATCTTTTC CATACTCAAG
TCGCAGGCCA AAGGTGGCGA ACCCATCTAC GGCGATGGCG TGTTGGAGAT CCTGCAGGAT
GGCTTCGGCT TTCTGAGGTC TGCCGACAGT TCATACATGG CGGGGCCGGA CGATATCTAC
GTCTCGCCGA GCCAGATCCG GCGCTTCGCG CTGCGCACCG GCGACACCAT CGCGGGCAAG
ATCCGCCCGC CCAAGGATGG TGAGCGCTAC TTCGCGCTGC TCAAGGTCAA TGAGATCAAC
TTCGAGCCGC CGGAGAACGC CAAGCACAAG GTGCTGTTCG AGAACCTCAC GCCGCTGCAC
GCCACCGAGC GTATGCGCAT GGAGCGCGGC AACGGCTCCA CGGAAGATCT CACCGCGCGG
ATCATCGACC TGATCGCCCC CATCGGTAAG GGCCAGCGCG GCCTGCTGGT TTCGCCGCCC
AAGGCGGGTA AGACGATGCT GCTGCAGCAC ATCGCGCAGA GCATCACCGC CAACCACCCC
GAGACCTATC TGATCGTGCT GCTGATCGAC GAGCGCCCGG AGGAGGTCAC GGAGATGCAG
CGCTCGGTGC GCGGCGAGGT GGTCTCCTCG ACCTTCGACG AGCCGGCCAG CCGCCACGTG
CAGGTGGCGG AGATGGTGAT CGAGAAGGCC AAGCGGCTGG TGGAGCACAA GAAGGATGTG
GTGATCCTGC TGGATTCCAT CACCCGTCTG GCCCGGGCCT ATAACACGGT GGTGCCTTCG
TCGGGCAAGG TGCTGACCGG CGGTGTGGAC GCCAATGCCC TGCACCGGCC GAAGCGTTTC
TTCGGTGCGG CGCGCAATGT GGAGGAAGGC GGCAGTCTGT CGATCATCGC CACGTCGCTG
GTGGACACCG GCTCGCGGAT GGATGAGGTG ATTTACGAGG AGTTCAAGGG CACCGGCAAT
ATGGAGCTGC ACTTGGACCG GCGCATCTCC GAGAAGCGGA TCTACCCGGC GGTGAACATC
AACCGTTCGG GCACCCGCCG CGAGGAGTTG CTGATGAAGC CGGACGAGTT GCAGAAGGTG
TGGATCCTGC GCAAGCTGCT GCACCCGATG GACGACCTGG CCGCCATCGA GTTCCTGCTG
GACAAGCTGA AGGACACCAA GACCAACGGC GAGTTCTTTG ACTCGATGAA GCGGTGA
 
Protein sequence
MNLTELKQKP AAELMKLAQE LGVEGTARSR KQDVIFSILK SQAKGGEPIY GDGVLEILQD 
GFGFLRSADS SYMAGPDDIY VSPSQIRRFA LRTGDTIAGK IRPPKDGERY FALLKVNEIN
FEPPENAKHK VLFENLTPLH ATERMRMERG NGSTEDLTAR IIDLIAPIGK GQRGLLVSPP
KAGKTMLLQH IAQSITANHP ETYLIVLLID ERPEEVTEMQ RSVRGEVVSS TFDEPASRHV
QVAEMVIEKA KRLVEHKKDV VILLDSITRL ARAYNTVVPS SGKVLTGGVD ANALHRPKRF
FGAARNVEEG GSLSIIATSL VDTGSRMDEV IYEEFKGTGN MELHLDRRIS EKRIYPAVNI
NRSGTRREEL LMKPDELQKV WILRKLLHPM DDLAAIEFLL DKLKDTKTNG EFFDSMKR