Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0838 |
Symbol | |
ID | 4270775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 950150 |
End bp | 952243 |
Gene Length | 2094 bp |
Protein Length | 697 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638125590 |
Product | Rhs element Vgr protein |
Protein accession | YP_741682 |
Protein GI | 114319999 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTACGG ACAACGGACT GTACCTCGCC CTCTCGCGCC CCGGCGTGGC GGAGGCCGAC CTGCCCCGGG TCACCGGGTT CACCCTGGAT GAGCACCTCT CCCGGCCCTT CACCCTGACC CTGGACCTGG TCCACCCCTC ACCCGATCTC GCCCCCGACG ACTGGCTTGA GCAGGGCCTG GCCCTGGTGA TCCATCAGGC CGGCCGCGTC ACCCGCCGGG TCCACGGCGT GGTCACCGAG TTCCAGCGCG GCCGCACCGG CGCCCGGCGC ACCGCCTACC AGCTGGTCGT CCGCCCCGCC CTCTGGCGGC TCTCCCTGCG CCGAAACTGC CGCATCTTCC AGCACGCCTG CCTGCTGGAT GTCCTCCACA CCCTGCTTTC GGAACATGGC ATCACCGACG CCGCCTTCGC CGTCCGCCAC CGCCCCGAGA CCCGCGAGTA CCTGGTGCAG TACCGGGAGA GCGACCTGGC CTTTGTCCAG CGGCTTGCCG CCGAGTTGGG CATCGTCTAC TTCCACGAGT TCGACGACAC CCCCGAGGGC GGCCACCGCC CGGTGTTCAC CGATACCCAT CGGGGGTTGG GGCATGCGGG CGAATGGGTC TACCGCCCCC GCGCCGGCGG CGTAGCCGAG GCCCGCCATG TGCACACCCT GCGCGAGGCC CACCGGGTGC GCGCGCAGCG CGCCACCCTG GAGGATCGCC ATTTCCGCAC CCCCCGGCGG CGGCTGATCC ACGCCCATGA GGCGGAGGGG GCCGAGTCGG GCGCCACCCC CTACGAGCAC TACGACCACC CCGGCCGGTT TAAGAGCGAG GCCAGCGGCC GGGCCTTCAC CCGGGTCCGG CTCGGCCAGT TGCGTGCCGA CGCCCACACC GCCGAGGCCG AGAGCGATAT CGCCGAGCTG CGCCCCGGCG TGCGCTTCAC CCTGGATGGC CACGACGCCG GCGAACGGCG CCGGGACTGG CAGGTGGTCG GCGCCCGCCA CACCGCCCGC CAGCCCGCCG CGCTGGAAGA GGACGCGATC CTGCTGGCCG CCGAGGGGCA GGGCGAGAAC GAGGCAGGCG TGGCCCGGCT GAACAACCGG CTCACCCTCG TGCCCGCCGA CACCGACTGG CGCCCGCCCC ACGACCCGGA CGCCGGCCCG CGCATGGAGG GCCCGCAGAT CGCCCGGGTG GTGGGCCCCG AGGGGGAGGC GATCCACTGC GATGAGCACG GCCGGGTCAA GGTCCGTTTC CCCTGGGACC GCTACGCCGC CGATGACGAG CACGCCAGCG CCTGGCTGCG CGTCGCCCAG CCCTGGGCCG GGCCCGGCTA CGGCGGGCTG TTCCTGCCCC GGGTGGGCCA TGCGGTGATC GTCGACTTCA TGGCCGGCGA TCCGGATCAG CCGGTGATCA CCGGCCGGGT CTACGATGGC CACAACACCC CGCCCTATCC GCTGCCCGAG CACAAGACCC GCAGCGTGCT GCGCAGCCGC AGCCAGGACG GTGAGGGCTA CAACGAACTG CACTTCGAGG ATGCCCGCGA GGCCGAGCGC ATCCACCTGC ACGCCCAGCG CGATCTCGAC CTGCACACCC GCAACGACCG CTCCGAGACC ATCGGCCGGC ACAGCCACCT GGGCGTCCAC GGCGACCGGC TCGCGGAGAT CCACGGCGAC GAGCACCTCA CCGTGCAGGG CGAGCGGCGC GAGCGCACCG GTGGGGATCA GCATCTCAGC GTGGAGGGCA CCCTGCATCT CAAAGCCGGT GAGGCCTGGC TAAGCGAATG CGGCCGGGAA CTGCACGTCA AGTCGGGGCA CAAGGCGGTC ATCGACGCCG GCGCCGAGAT CACCCTCCAG GCGGGCGGCA GTTTCATCAA GGTCGATCCC TCGGGCATCA CCCTCAGCGG CCCCGGCATC CGCATGAACT CCGGTGGTCG CCCGGGCTCG GGATCGGGCC AACGCACGGC AACGCCCCTG TTGCCCGGGC GGGTCATGGC GGCGGAGGCC GATGGCTCCG CTAAGCCGGG ACCTTCGGCG GTGCTCAAGC AGAGGTTCCT CCTGCACCAG GCCGCCCAGT CCGGGGCGGG CCTGTGCGAG GTATGTAGCG GCAAGGGAGA ATAG
|
Protein sequence | MPTDNGLYLA LSRPGVAEAD LPRVTGFTLD EHLSRPFTLT LDLVHPSPDL APDDWLEQGL ALVIHQAGRV TRRVHGVVTE FQRGRTGARR TAYQLVVRPA LWRLSLRRNC RIFQHACLLD VLHTLLSEHG ITDAAFAVRH RPETREYLVQ YRESDLAFVQ RLAAELGIVY FHEFDDTPEG GHRPVFTDTH RGLGHAGEWV YRPRAGGVAE ARHVHTLREA HRVRAQRATL EDRHFRTPRR RLIHAHEAEG AESGATPYEH YDHPGRFKSE ASGRAFTRVR LGQLRADAHT AEAESDIAEL RPGVRFTLDG HDAGERRRDW QVVGARHTAR QPAALEEDAI LLAAEGQGEN EAGVARLNNR LTLVPADTDW RPPHDPDAGP RMEGPQIARV VGPEGEAIHC DEHGRVKVRF PWDRYAADDE HASAWLRVAQ PWAGPGYGGL FLPRVGHAVI VDFMAGDPDQ PVITGRVYDG HNTPPYPLPE HKTRSVLRSR SQDGEGYNEL HFEDAREAER IHLHAQRDLD LHTRNDRSET IGRHSHLGVH GDRLAEIHGD EHLTVQGERR ERTGGDQHLS VEGTLHLKAG EAWLSECGRE LHVKSGHKAV IDAGAEITLQ AGGSFIKVDP SGITLSGPGI RMNSGGRPGS GSGQRTATPL LPGRVMAAEA DGSAKPGPSA VLKQRFLLHQ AAQSGAGLCE VCSGKGE
|
| |