Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2254 |
Symbol | |
ID | 4026064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2531081 |
End bp | 2533708 |
Gene Length | 2628 bp |
Protein Length | 875 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637967458 |
Product | Rhs element Vgr protein |
Protein accession | YP_574303 |
Protein GI | 92114375 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria [COG4253] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAGA CGATGCAGGA TCGTTTACTG GATAGCTTCG ATATCGATCT TTCGCAGTCG CAGCGGTTGC TGACCCTCGA GGTCGCCGGG ACGGCGCTGC TGCCGCATCG CCTGGTGGGG GAGGAGCGCG TGAGTGCCCC CTTCACCTAC ACCCTCGACT GTATCTCCCA GCAGAACGAT CTCGAGCTCA AGACATTGCT GGCCCAGCCG GCCCGGCTTT CGATCCTGCA GGCCGACGGC AGCTATCGCC CGCTGCACGG GCTGGTCTCC GAGGCCGCGC TGCTGGGCGA GGATGGCGGC GTCACCTATT ACCAGCTCAC CCTGGTGCCG TGGCTCAAGA TGCTCGCGCT GGGCCGCGAC AGCCGCATCT TCCAGGACCG CAGCGTCGTT GATGTCCTCA CCCAGGTATT CGAGGGGCAT GCACTGGCCA GGGGCCGCTA CCGCTTCGAC CTGCGCCGCG ACTACCCCGC GCGCAGCTAC TGCGTGCAGT ACCGCGAGAG CGACCTGAAC TTCATCAGCC GCCTCTGCGA GGAAGAAGGC CTCTTCTACT ACACCGAATT CGCGGGCGAT GACGATGACT TCGACGGCCA CCGCATCGTC TTTACCGACG ATGTCGATAC CACCCAGCCC GTCAGCCCCC AGGCGATTCG CTTTCATCGT CAGGATGCCA CCGAGACCGA GGACGCCCTC AACCAGTGGG GCGGCGTGCG TCAGCAGCAG CCCACCCGGG TCAGCGTGGG CACCTTCGAC TACAAGCAGC CCTCGCTCAC CAAGCGCACC GGACTGGATA CTCTCAGCGA TCAGGGCAAC CTGCCGCCGA CCGAGGTCTA CGACTACGCC GGCGAGTACT ACTACCACGG CTTCGAGCGC GGCGAGCGCC TCACCGAGAA CCGCCTCGAG GCCCATGAAT CGCTGGCCAA GCGCTTTCGC GGCAGCGGCG GTGCCCGCCA GCTCCAGGCC GGGCGCTGGT TCGAACTCAC CCAGCACCCG CTGCACGACC GGGGCGGCGA GGAAGAACGC CAGTTCCTGC TGCTCGGCGT CACCGTCCAC GCCGAGAACG CGCTGCCGGT CTCGACCCAG CACCAGGCGC TGCCCGGCAG CCTGCAGCCG CAGCTCGACG CCGCCAAGCA GGCGCATGGG CTGGAGAGTG ATACCGCTGA TCCCACCGGC GAGCGGCTAT CGGATTACGC CACCGGCGGC ACCGGCCACT TCCTGGTCGA TCTCGAGGCC CAGCGGCGCA GCCAGCCCTA TCGCCACCCG CTGACCCATC GCCGCCCGGT GATCGGCGGG CCCCAGACCG CCACCGTGGT CGGCCCCGCC AACGAGGAGA TCCACACCGA TGCCCTCAAC CGCGTGCGGG TGCAGTTCCA CTGGGACCGC CAGGGCCAGC ATGACGAGAA CTCCAGCGTC TGGCTGCGGG TCTCCCAGCC CAACGCCGGT GCCGGCTGGG GCGGCGTGTT CGTGCCGCGC ATCGGCCAGG AAGTCCTCGT CGACTTCCTC GAGGGCGACG CCGACCGGCC GCTGATCACC GGCCGGGTCT ACAACGGCGA ACAGAGCCCC GACTGGCACA GCCACGGCCT GCTCTCGGGC TTCAAGAGCA AGACCTATCG CGGCAGCAAG TACAACGAAC TCGTCTTCGA CGACGCCACC GACCAGGAGC GCGTTCGTCT CAGTTCCGAA GCCGAGAAGA GCCAGCTCAA CCTCGGCTAC CTGATCCACC AGACCGGCAA CACCCGGGGC GCCTTCCGCG GCACCGGGTT CGAGCTGCGC AGCGACGCCT ATGGCGCTAT CCGTGCCAAC CAGGGGCTCT ATCTCACCAG CTGGGGCCAG CTCGGCGCCA GCGGTGACCA GCTCGACCTC ACCCCGGCCA AGCAGCAGCT CGACAGCGCC TATAACCTCA CCGACAGCCT CAGCCAGAGT GCGGCGGACC ACAACGCCGA GGCCCTCGAC AGTCGTACAC ACCTCAAGCA GGCGGGCGAG GACGCCGACG ACACCTACGG CACCAGCGAG CAGCTCGTCG ATTCACAAGG TCAGGCCAAT CAGGACAGCG CCCGTGGCGC CACCGACAGC GGCGGCCGCG GTGAAGCGGC GCGCATGAAA GCCCCCTGGC TGCACCTGGC CTCGCCCGCC GGCATCACGC TGAGCACCCC GGAGTCGACC CATCTGGCCC AGGGCCGCTC GCTGAGTGTT TCCAGCGGCG AGGACGTCAA CGTGGCTACC GGCAAGAGTC TGGTCGCCTC CATTTCCGAG AGGCTCTCGC TGTTCGTGCA GAAGGCTGGC ATCAAGCTGT TCTCGGCTCG GGGCAAGGTG GAGATGCAGG CGCAGTCGGA CAACCTGGAA GCCACCGCGC GGCGGGATCT CGGCGTGACC TCCACGGAAG ACAAGATCGA CTTCGCCGCC GCCGAAGAGA TTTTGCTGGT CAGCGGCGGC GCCTACGTGC GCATCAAGGG CGGCGATATC GAGCTGCACG CCCCCGGCAA GGTCACCGTC AAGGGCGCGA CGAAGAACTT CGGCGGGCCG GCGAGCCTGA ACTCGCAGAT GCCTTATTTG CCGGAGGAGG CGAGCCCTCT TTGTCCCAAG AAGAACCGAG CTGCAGCCAG CGTGGGCGAG GGAAGCGTGG AGATATGA
|
Protein sequence | MEETMQDRLL DSFDIDLSQS QRLLTLEVAG TALLPHRLVG EERVSAPFTY TLDCISQQND LELKTLLAQP ARLSILQADG SYRPLHGLVS EAALLGEDGG VTYYQLTLVP WLKMLALGRD SRIFQDRSVV DVLTQVFEGH ALARGRYRFD LRRDYPARSY CVQYRESDLN FISRLCEEEG LFYYTEFAGD DDDFDGHRIV FTDDVDTTQP VSPQAIRFHR QDATETEDAL NQWGGVRQQQ PTRVSVGTFD YKQPSLTKRT GLDTLSDQGN LPPTEVYDYA GEYYYHGFER GERLTENRLE AHESLAKRFR GSGGARQLQA GRWFELTQHP LHDRGGEEER QFLLLGVTVH AENALPVSTQ HQALPGSLQP QLDAAKQAHG LESDTADPTG ERLSDYATGG TGHFLVDLEA QRRSQPYRHP LTHRRPVIGG PQTATVVGPA NEEIHTDALN RVRVQFHWDR QGQHDENSSV WLRVSQPNAG AGWGGVFVPR IGQEVLVDFL EGDADRPLIT GRVYNGEQSP DWHSHGLLSG FKSKTYRGSK YNELVFDDAT DQERVRLSSE AEKSQLNLGY LIHQTGNTRG AFRGTGFELR SDAYGAIRAN QGLYLTSWGQ LGASGDQLDL TPAKQQLDSA YNLTDSLSQS AADHNAEALD SRTHLKQAGE DADDTYGTSE QLVDSQGQAN QDSARGATDS GGRGEAARMK APWLHLASPA GITLSTPEST HLAQGRSLSV SSGEDVNVAT GKSLVASISE RLSLFVQKAG IKLFSARGKV EMQAQSDNLE ATARRDLGVT STEDKIDFAA AEEILLVSGG AYVRIKGGDI ELHAPGKVTV KGATKNFGGP ASLNSQMPYL PEEASPLCPK KNRAAASVGE GSVEI
|
| |