Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_1055 |
Symbol | |
ID | 5134605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | - |
Start bp | 1028210 |
End bp | 1029232 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640531377 |
Product | P2 family phage major capsid protein |
Protein accession | YP_001215891 |
Protein GI | 147672444 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01551] phage major capsid protein, P2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCAGA TTCTTACTCA ATCCGCCCGT GAATACATGG ATAACTTCGC TCAGCAATTG GCGAAAAGCT ACGGCGTATC GAACGTAGCA GAACTATTCA ATGTATCACC GCAGTTGGAA ACCAAACTTC GCGCAGCCAT TACCGAGTCT GCCGAGTTTC TGAAAATTAT CACCGTGACC ACCGTTGACC AAATCGAAGG TCAAGTGGTC GATGTGGGTG TGTCCGGTCT TTACACTGGC CGTAAAGCGG GTGGCCGTTT CACCAAGCAA GTGGGCGTAG GTGGTCACAA ATACAAACTC GCGGAAACCG ATTCTTGTGC CGCCATCACT TGGGCAATGC TATGCCAGTG GGCGAACCAA GGTGGCCGCG ATCAGTTCAT GAAGCACCTG ACTGAATTCT CTAACCAAAT GTTCGCACTC GACATCATGC GTGTGGGCTG GAATGGCGTT ACCGCAGCAG AAACGACCAA TCCAGCAGAG TATCCGCTCG GCCAAGATGT CAACGAAGGC TGGATTGCGT ATGTGAAAAA CCGCAAAGCC TCACAAGTGG TGGATGTCGA TGTCTACTTC GATGAAACCA ACGGCGACTA CCGCACGCTA GACGCGATGG CCTCAGACAT CATCAACAAC CAAATTCACC CCATGTTCCG CAACGACCCA CGCTTAACCG TGTTCGTCGG TTCAGGGCTG ATAGGTGCCG CTCAAGCCAA ACTGTACGAC AAAGCGGACA AACCTAGCGA ACAAATCGCC GCTCAAAAGC TCGATAAAAC CATCGCAGGC CGTCCCGCTT ACGTGCCGCC ATTCCTGCCA GACAACGCCA TGGTCGTGAC CATTCCGGCA AACCTGCAGG TATTGACTCA GCACGGCACA GTGCAACGTA AAGCGAAACA CGAATCTGAT CGCAAGCAGT TCGAAAACGC ATACTGGCGC ATGGAAGGTT ACGCAGTGGG TGTGTTGGAA GCGTTCGCCG CTTACAACCC AGAAAAAGTC CACATCGGCC CTAAACCAGA GGCTGGAGCG TAA
|
Protein sequence | MSQILTQSAR EYMDNFAQQL AKSYGVSNVA ELFNVSPQLE TKLRAAITES AEFLKIITVT TVDQIEGQVV DVGVSGLYTG RKAGGRFTKQ VGVGGHKYKL AETDSCAAIT WAMLCQWANQ GGRDQFMKHL TEFSNQMFAL DIMRVGWNGV TAAETTNPAE YPLGQDVNEG WIAYVKNRKA SQVVDVDVYF DETNGDYRTL DAMASDIINN QIHPMFRNDP RLTVFVGSGL IGAAQAKLYD KADKPSEQIA AQKLDKTIAG RPAYVPPFLP DNAMVVTIPA NLQVLTQHGT VQRKAKHESD RKQFENAYWR MEGYAVGVLE AFAAYNPEKV HIGPKPEAGA
|
| |