Gene Mlg_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1145 
Symbol 
ID4269640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1340275 
End bp1342395 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content72% 
IMG OID638125894 
ProductRhs element Vgr protein 
Protein accessionYP_741984 
Protein GI114320301 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAG ACAACGGACT GTACCTCGCC CTCTCGCGCC CCGGCGCGGC AGAGGCCGAC 
CTGCCCCGGG TCACCGGGTT CACCCTGGAT GAGCACCTCT CCCGGCCCTT CACGCTGACC
CTGGACCTGG TCCACCCCTC GCCCGATCTC GCCCCCGACG ACTGGCTTGA GCAGGGCCTG
GCACTGGCCA TCCATCAGGC CGGCCGCGTC ACCCGCCGGG TCCACGGCGT GGTCACCGAG
TTCCAGCGCG GCCGCACCGG CGCCCGGCGC ACCGCCTACC AACTGGTCGT CCGCCCCGCC
CTCTGGCGCC TTGCCCTGCG CCGCAACGCC CGCATCTTCC AGCACGCCTG CCCGCTGGAT
GTCCTCCACA CCCTGCTTTC GGAACATGGC ATCACCGACG CCGCCTTCGC CGTCCGCCAC
CGCCCCGAGC CCCGCGAGTA CCTGGTGCAG TACCGGGAGA GCGACCTGGC CTTTGTCCAG
CGGCTTGCCG CCGAGTTGGG CATCGTCTAC TTCCACGAGT TCGACGACAC CCCCGAGGGC
GGCCACCGCC CGGTGTTCAC CGATACCCAC CGGGGGCTGG GGCAGGCGGG CGAATGGGCC
TACCGCCCCC GCGCCGGCGG CGTGGCCGAG GCCCGCCATG TGCACACCCT GCGCGAGGCC
CACCGGGTGC GCGCGCAGCG CGCCACCCTG GAGGATCGCC ATTTCCGCAC CCCCCGGCGG
CGGCTGATCC ACGCGCAGGC GGTGGATGGG GCAGCCGGGC ATGATGGCGC CGAGTCGGGG
GCCACCCCCT ACGAGCACTA CGACCACCCC GGCCGGTTCA AGAGCGAGAC CAGCGGCCGG
GCCTTCACCC GGGTCCGGCT CGGCCAGTTG CGCGCCGACG CCCACACCGC CGAGGCCGAA
AGCGATATTG CCGAGCTGCG CCCCGGCGTG CGCTTCACCC TCGATGGCCA CGACGCCGGC
GAACGGCGCC GCGACTGGCA GGTGGTCGGC GCCCTTCATA CCGCCCGGCA GCCCGCCGCG
CTGGAAGAGG ACGCGATCCT GCTGGCCGAC GAGGACGAGG CAGGCGTGGC CCGGCTGAAC
AACCGGCTCA CCCTCGTGCC CGCCGACACC GACTGGCGCC CGCCCCACGA CCCGGGCGCC
GGCCCGCGGA TGGAGGGCCC GCAGATCGCC CGGGTGGTGG GCCCCGAGGG GGAGGCGATC
CATTGCGATG AGCACGGCCG GGTCAAGGTC CGTTTCCCCT GGGACCGCTA CGCCGCCGAC
GACGAGCACG CCAGCGCCTG GCTGCGCGTC GCCCAGCCCT GGGCCGGGCC CGGCTACGGC
GGGCTGTTCC TGCCCCGGGT GGGCCATGCG GTGATTGTCG ACTTCATGGC CGGCGACCCG
GACCAGCCGG TGATCACCGG CCGGGTCTAC GATGGCCACA ACACCCCGCC CTATCCGCTG
CCCGAGCACA AGACCCGCAG CGTGCTGCGC AGTCGCAGCC ACGGCGGCGA GGGCTACAAC
GAACTGCACT TCGAGGACGC CCACGACGCC GAGCGCATCC ACCTGCACGC CCAGCGCGAT
CTCGACCTGC ACACCCGCAA CGACCGCTCC GAGACCATCG GCCGGCACAG CCACCTGGGC
GTCCACGGCG ACCGGCTCGC GGAGATCCAC GGCGACGAGC ACCTCACCGT GCAGGGCGAG
CGGCGCGAGC GCACCGGTGG GGATCAGCAT CTCAGCGTGG AGGGCACGCT CCATCAATAC
CAAGGCGAGT GCCTTTTGGT GGAGGCTGGG CACGAGATCC ACCATGCCGC CGGGGTCAAG
CTGATCCTGG AGGCCGGTGC CGAGATCACC CTCCGGGCCG GCGGCAGCTT CATCAAGCTC
GACCCCTCCG GCATCACCCT CAGCGGCCCC GGCATCCGTA TAAACTCCGG AGGCAGCCCG
GGCTCGGGGA GTGCACAGCG GGCCCAGACG CCCGCACGGC CGGGGCACGT GCCGGCCGAA
CCGCCCCCAC GAGTCGTAAA GGGCGTTGGC CCGGACCCCG AGCGCTATGC ACGCAGCGAG
GCCGCCCGCA TTCAATTGTG TGGCAAGGAC ACGGAGTCCG GTAACTGTTC CAGGGAGGAA
TGCCCGTGCA CGAGTGGCTG A
 
Protein sequence
MPIDNGLYLA LSRPGAAEAD LPRVTGFTLD EHLSRPFTLT LDLVHPSPDL APDDWLEQGL 
ALAIHQAGRV TRRVHGVVTE FQRGRTGARR TAYQLVVRPA LWRLALRRNA RIFQHACPLD
VLHTLLSEHG ITDAAFAVRH RPEPREYLVQ YRESDLAFVQ RLAAELGIVY FHEFDDTPEG
GHRPVFTDTH RGLGQAGEWA YRPRAGGVAE ARHVHTLREA HRVRAQRATL EDRHFRTPRR
RLIHAQAVDG AAGHDGAESG ATPYEHYDHP GRFKSETSGR AFTRVRLGQL RADAHTAEAE
SDIAELRPGV RFTLDGHDAG ERRRDWQVVG ALHTARQPAA LEEDAILLAD EDEAGVARLN
NRLTLVPADT DWRPPHDPGA GPRMEGPQIA RVVGPEGEAI HCDEHGRVKV RFPWDRYAAD
DEHASAWLRV AQPWAGPGYG GLFLPRVGHA VIVDFMAGDP DQPVITGRVY DGHNTPPYPL
PEHKTRSVLR SRSHGGEGYN ELHFEDAHDA ERIHLHAQRD LDLHTRNDRS ETIGRHSHLG
VHGDRLAEIH GDEHLTVQGE RRERTGGDQH LSVEGTLHQY QGECLLVEAG HEIHHAAGVK
LILEAGAEIT LRAGGSFIKL DPSGITLSGP GIRINSGGSP GSGSAQRAQT PARPGHVPAE
PPPRVVKGVG PDPERYARSE AARIQLCGKD TESGNCSREE CPCTSG