Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_1039 |
Symbol | |
ID | 5134093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | - |
Start bp | 1018862 |
End bp | 1020061 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640531361 |
Product | hypothetical protein |
Protein accession | YP_001215875 |
Protein GI | 147671782 |
COG category | [S] Function unknown |
COG ID | [COG3299] Uncharacterized homolog of phage Mu protein gp47 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAA GACCGCAGGC CGACTTTGTC GAAATACTCT CAGAATCGGG CGTGCCAGTT ACTGAGGATG CCTTCGAGGC CGCGCTCAAA GCAGACGTGA CAGAGTCAGG AAGCCTCTTG TCCAACGATT CGCAAATGTC ACCCTTCTGG CGTTGGGTTC GTGCTGCTGT TGTGACACCT GCCGTGTGGC TGATCCGCAC ACTGCTCGCA GGGCATGTCA TGCCCAATAT CTTTGTGGGT ACGGCGGAAC GTTGGGCGCT AGAGCTAAAA GCATGGGAAT ACAACGTCAC GCCCAAAGGC GCAGTGAGCA CCCAAGGCTT AATCACCTTC ACCAAAGCCA ACGCCGCAGA TGAAACCAGT ATCGAAGCAG GAACCATCAT TCAAACGCCA GAGATTGAAG GCAAGGTGTA CAAACTTACC GCAATAAAAA CCACGGTGAT CAAAGCGGGG CAAGCCTCCG GCAAAGTCTT GTGTGAAGCC AGTGAAGCGG GAGCCGCTTA CAACCTGCCC GCCGGCTATT TCAGCATTCT GCCGCAGGGC GTATCGGGCA TTGTCTCTGT CACCAATGAA GCGAATTGGA TAACCCAACT CGGCGCAGAC CAAGAAAGCG ACGAAGAATT AGCCCTACGC CTACAAAACG CCTTTACCAG TGCGGGCGAA TGGCACATTG ACGATGTTTA CCGCGCCATG ATTGCCAGCG TGGCGGGGAT CCGTAGTGAT AACATCTTCT TTGAAAACAC AGGCCACATC ACACCGGGTA GCGCGAATGC TTACATTCTG ATGGAAGTGG GCGCAACGCC ACAGCATGTG CTTGACCAAC TCAATAAACA TATCATGCAA GACGGCCACC ACGGCCACGG TGACGTGCTG ACTTGTTTAG CCATCCCAGA GACTCAGCAC AGCATCAGTG CGCAGGTGGT CTTTGTCGCG AATCTCGATG AGATGCAGAA AATCAATGAA CTGCTGGAAG TAGAAAACCG CATTCGTGCC GCATTCCGTG AAACAGCGGC TTATCCAGAA ATGACCAGAG CGAAACCAGA AAGCCGATTC AGCATTTCAC AGCTCGCCCA TGAAATTCAC AGCAAGATGG AGAACGTCGA ATCCGTACTC ATCAAAGTAG ACGGTGAACC AACCGACATC ATCAGCTTGC TCACTCAACC CCGCTTACAA ACCCTCACCG TCACGGAGCT GGAACAATGA
|
Protein sequence | MSKRPQADFV EILSESGVPV TEDAFEAALK ADVTESGSLL SNDSQMSPFW RWVRAAVVTP AVWLIRTLLA GHVMPNIFVG TAERWALELK AWEYNVTPKG AVSTQGLITF TKANAADETS IEAGTIIQTP EIEGKVYKLT AIKTTVIKAG QASGKVLCEA SEAGAAYNLP AGYFSILPQG VSGIVSVTNE ANWITQLGAD QESDEELALR LQNAFTSAGE WHIDDVYRAM IASVAGIRSD NIFFENTGHI TPGSANAYIL MEVGATPQHV LDQLNKHIMQ DGHHGHGDVL TCLAIPETQH SISAQVVFVA NLDEMQKINE LLEVENRIRA AFRETAAYPE MTRAKPESRF SISQLAHEIH SKMENVESVL IKVDGEPTDI ISLLTQPRLQ TLTVTELEQ
|
| |