Gene Mlg_0838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0838 
Symbol 
ID4270775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp950150 
End bp952243 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content72% 
IMG OID638125590 
ProductRhs element Vgr protein 
Protein accessionYP_741682 
Protein GI114319999 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACGG ACAACGGACT GTACCTCGCC CTCTCGCGCC CCGGCGTGGC GGAGGCCGAC 
CTGCCCCGGG TCACCGGGTT CACCCTGGAT GAGCACCTCT CCCGGCCCTT CACCCTGACC
CTGGACCTGG TCCACCCCTC ACCCGATCTC GCCCCCGACG ACTGGCTTGA GCAGGGCCTG
GCCCTGGTGA TCCATCAGGC CGGCCGCGTC ACCCGCCGGG TCCACGGCGT GGTCACCGAG
TTCCAGCGCG GCCGCACCGG CGCCCGGCGC ACCGCCTACC AGCTGGTCGT CCGCCCCGCC
CTCTGGCGGC TCTCCCTGCG CCGAAACTGC CGCATCTTCC AGCACGCCTG CCTGCTGGAT
GTCCTCCACA CCCTGCTTTC GGAACATGGC ATCACCGACG CCGCCTTCGC CGTCCGCCAC
CGCCCCGAGA CCCGCGAGTA CCTGGTGCAG TACCGGGAGA GCGACCTGGC CTTTGTCCAG
CGGCTTGCCG CCGAGTTGGG CATCGTCTAC TTCCACGAGT TCGACGACAC CCCCGAGGGC
GGCCACCGCC CGGTGTTCAC CGATACCCAT CGGGGGTTGG GGCATGCGGG CGAATGGGTC
TACCGCCCCC GCGCCGGCGG CGTAGCCGAG GCCCGCCATG TGCACACCCT GCGCGAGGCC
CACCGGGTGC GCGCGCAGCG CGCCACCCTG GAGGATCGCC ATTTCCGCAC CCCCCGGCGG
CGGCTGATCC ACGCCCATGA GGCGGAGGGG GCCGAGTCGG GCGCCACCCC CTACGAGCAC
TACGACCACC CCGGCCGGTT TAAGAGCGAG GCCAGCGGCC GGGCCTTCAC CCGGGTCCGG
CTCGGCCAGT TGCGTGCCGA CGCCCACACC GCCGAGGCCG AGAGCGATAT CGCCGAGCTG
CGCCCCGGCG TGCGCTTCAC CCTGGATGGC CACGACGCCG GCGAACGGCG CCGGGACTGG
CAGGTGGTCG GCGCCCGCCA CACCGCCCGC CAGCCCGCCG CGCTGGAAGA GGACGCGATC
CTGCTGGCCG CCGAGGGGCA GGGCGAGAAC GAGGCAGGCG TGGCCCGGCT GAACAACCGG
CTCACCCTCG TGCCCGCCGA CACCGACTGG CGCCCGCCCC ACGACCCGGA CGCCGGCCCG
CGCATGGAGG GCCCGCAGAT CGCCCGGGTG GTGGGCCCCG AGGGGGAGGC GATCCACTGC
GATGAGCACG GCCGGGTCAA GGTCCGTTTC CCCTGGGACC GCTACGCCGC CGATGACGAG
CACGCCAGCG CCTGGCTGCG CGTCGCCCAG CCCTGGGCCG GGCCCGGCTA CGGCGGGCTG
TTCCTGCCCC GGGTGGGCCA TGCGGTGATC GTCGACTTCA TGGCCGGCGA TCCGGATCAG
CCGGTGATCA CCGGCCGGGT CTACGATGGC CACAACACCC CGCCCTATCC GCTGCCCGAG
CACAAGACCC GCAGCGTGCT GCGCAGCCGC AGCCAGGACG GTGAGGGCTA CAACGAACTG
CACTTCGAGG ATGCCCGCGA GGCCGAGCGC ATCCACCTGC ACGCCCAGCG CGATCTCGAC
CTGCACACCC GCAACGACCG CTCCGAGACC ATCGGCCGGC ACAGCCACCT GGGCGTCCAC
GGCGACCGGC TCGCGGAGAT CCACGGCGAC GAGCACCTCA CCGTGCAGGG CGAGCGGCGC
GAGCGCACCG GTGGGGATCA GCATCTCAGC GTGGAGGGCA CCCTGCATCT CAAAGCCGGT
GAGGCCTGGC TAAGCGAATG CGGCCGGGAA CTGCACGTCA AGTCGGGGCA CAAGGCGGTC
ATCGACGCCG GCGCCGAGAT CACCCTCCAG GCGGGCGGCA GTTTCATCAA GGTCGATCCC
TCGGGCATCA CCCTCAGCGG CCCCGGCATC CGCATGAACT CCGGTGGTCG CCCGGGCTCG
GGATCGGGCC AACGCACGGC AACGCCCCTG TTGCCCGGGC GGGTCATGGC GGCGGAGGCC
GATGGCTCCG CTAAGCCGGG ACCTTCGGCG GTGCTCAAGC AGAGGTTCCT CCTGCACCAG
GCCGCCCAGT CCGGGGCGGG CCTGTGCGAG GTATGTAGCG GCAAGGGAGA ATAG
 
Protein sequence
MPTDNGLYLA LSRPGVAEAD LPRVTGFTLD EHLSRPFTLT LDLVHPSPDL APDDWLEQGL 
ALVIHQAGRV TRRVHGVVTE FQRGRTGARR TAYQLVVRPA LWRLSLRRNC RIFQHACLLD
VLHTLLSEHG ITDAAFAVRH RPETREYLVQ YRESDLAFVQ RLAAELGIVY FHEFDDTPEG
GHRPVFTDTH RGLGHAGEWV YRPRAGGVAE ARHVHTLREA HRVRAQRATL EDRHFRTPRR
RLIHAHEAEG AESGATPYEH YDHPGRFKSE ASGRAFTRVR LGQLRADAHT AEAESDIAEL
RPGVRFTLDG HDAGERRRDW QVVGARHTAR QPAALEEDAI LLAAEGQGEN EAGVARLNNR
LTLVPADTDW RPPHDPDAGP RMEGPQIARV VGPEGEAIHC DEHGRVKVRF PWDRYAADDE
HASAWLRVAQ PWAGPGYGGL FLPRVGHAVI VDFMAGDPDQ PVITGRVYDG HNTPPYPLPE
HKTRSVLRSR SQDGEGYNEL HFEDAREAER IHLHAQRDLD LHTRNDRSET IGRHSHLGVH
GDRLAEIHGD EHLTVQGERR ERTGGDQHLS VEGTLHLKAG EAWLSECGRE LHVKSGHKAV
IDAGAEITLQ AGGSFIKVDP SGITLSGPGI RMNSGGRPGS GSGQRTATPL LPGRVMAAEA
DGSAKPGPSA VLKQRFLLHQ AAQSGAGLCE VCSGKGE