Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1092 |
Symbol | |
ID | 5137048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1143052 |
End bp | 1144746 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640532550 |
Product | hypothetical protein |
Protein accession | YP_001217038 |
Protein GI | 147675149 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.480484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGTA TTACTTTTAA ACGTTCGATT TTAGCCAGCG CAGTATTGCT TGCTACCCAA ACGGCCAACG CGGCACTCTA CCAAGTGATG GAAGTGACCC CAAGCACAGG GCAAAGCTAC GGTAGTGCAT GGGGTGTGGC GATCCAACCA AGTACAGGCA CAGATAGCTG TTTTAATAAT TCGACTGTCG GTAGTGTGAA TTGTCAAAAC TTTGCTCTGG CAGGTGAAAC TCGCATTGAA AAAGCCAGCA CAGGTAAAGC AGTCGATGGT TTAAGTTACC GTGATGAAGT GGCTTTCGGG ATTGATAATG CGTTCGTTTA TGTACAAGAA CGCAATGACT TTGAACGCTA CTGCTACAAC GAACTTTTGT ATTCAACTTG TAATACCTGG GCAGATCCCC ATTGGAATCG CTGGCAAGCA GAGATCAATA GTACGCAAGT TGCTAACTCA ATTGCTTTTA TTGGTACTGG AACAACAGGC GCGCCTATTG ATGAGTCACA AAACGTGATT GTTAATAGCC TAACGAGTAA TGCTACGCCT ATCGGGATCA ATGTAAAAAC TGGTGACGTG ACAACGTATC GTAGAAACGC TAATGCCATT CAAGCCCGTT CCACCGTCGC ACCGAATATC ACTGATGCCT TATATACCCG TGCATGGAAA ACGGATGGTG TTTACACCGT TGGCAGTATT TCACGAAGCT CAAACAATAA CGAAGGTGCC TATTTTTATT CCAAGCCTGC AATTTGGAAA AATTCTAACG GTGAAACAGT AGAGCTTTCT TGGCCAACTG GTACCGAACC TAATCGCAAT AATCGTCTTG CACAAGGCAG TATGCGTGAT GTCGTTGAAA ATGGAGGTAA GCTATACGCG GTCGGTTATA GTTCATACGA TACAGACAAT CACTACATGC AGGCCTCGGT TTTCGAACTA GACAACACCA GTAATTTTTC TAATGCAGCA AGTTGGACGA CCAAAGCCGT TTCCGGCGCT GAATCTAGAA TTGGTGGCGA TTACATTCAC AGTAATTCGG TAGTCACCGA TGTAAACAAA AACCTCGTCG CGTTAGGCTC TGCTAAACGT GCAGGTAGCC GCCCAGAGAA TGGAGCCGCG GGCAATCGAT TATTTGTTAT CGAAGATGTT TCTGCTAGCA CTCCAACAGC GAACTTCTTA ACCGGCGGCA TCTTCTTCAC AGGAGCCGGC GGCAAAGCAG GGGCAATTAA CAGCTATAAC GAGATTGTCG GTCAAGTCGA CGCAAACGAC ACCCGTGAGA ATGATGGTAA ACCGCGCCGT AAAAGAGGGT TTATTTATCC ATACAGTGCC AATGGCAGTG ATCCAAGCCG CATGGCTATT TTTGCTAATA AGGCTTGGTT GTTAGATGAT CTCACTAATG ACAATACCGC AACAGGCAAC AACAACCAGT TCCGCATTAT TGATGCCACA GATATTAATG ATGCTGGCGT TATCTCTGCA ACTGCGCTGA AATGTTCAGG TGGATACGAT ACGACAGCAC ATAATTCATT GTGTAGTAAC CGCGAAGAAA CCGTCGTCGC CGTGAAGTTA GTTCCAATTG TGAATGCAAC AAGTGCTAAC ATTCAACAGC GTTCAACCGA AGAGCAAGCC TCTGAACGTA AAGGGGGTAG CTTCGGATTA GGGCTTTTGA TGGTGCTTGG TGTACTAGGG TTCCGTAGAA AATAG
|
Protein sequence | MSRITFKRSI LASAVLLATQ TANAALYQVM EVTPSTGQSY GSAWGVAIQP STGTDSCFNN STVGSVNCQN FALAGETRIE KASTGKAVDG LSYRDEVAFG IDNAFVYVQE RNDFERYCYN ELLYSTCNTW ADPHWNRWQA EINSTQVANS IAFIGTGTTG APIDESQNVI VNSLTSNATP IGINVKTGDV TTYRRNANAI QARSTVAPNI TDALYTRAWK TDGVYTVGSI SRSSNNNEGA YFYSKPAIWK NSNGETVELS WPTGTEPNRN NRLAQGSMRD VVENGGKLYA VGYSSYDTDN HYMQASVFEL DNTSNFSNAA SWTTKAVSGA ESRIGGDYIH SNSVVTDVNK NLVALGSAKR AGSRPENGAA GNRLFVIEDV SASTPTANFL TGGIFFTGAG GKAGAINSYN EIVGQVDAND TRENDGKPRR KRGFIYPYSA NGSDPSRMAI FANKAWLLDD LTNDNTATGN NNQFRIIDAT DINDAGVISA TALKCSGGYD TTAHNSLCSN REETVVAVKL VPIVNATSAN IQQRSTEEQA SERKGGSFGL GLLMVLGVLG FRRK
|
| |