Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0421 |
Symbol | |
ID | 5137828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 448004 |
End bp | 449377 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640531879 |
Product | hypothetical protein |
Protein accession | YP_001216376 |
Protein GI | 147673433 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.100113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGGAGG ACACTATGAT TATTCAAGTC AGCCCAGCCG GCAGTATGGA TCTACTGTCA CAACTCGAAG TCGAACGTCT GAAAAAGACC GCCTCGAGTG ATCTCTACCA ACTTTATCGC AATTGCAGCC TTGCCGTGTT GAACTCAGGT AGCCACACGG ATAACTCGAA AGAGTTGCTC GATAAATATA AGAATTTCGA TATCACCGTC ATGCGCCGTG AACGCGGCAT CAAACTTGAG CTGGCGAATC CCCCCGAACA TGCCTTTGTC GATGGACAGA TCATTAAAGG GATCCAAGAA CACCTCTTTT CTGTGCTGCG CGATATTGTT TACGTCAATA TGCATTTAGC CGATAGCCAG CGCCTTAACC TGACTAACGC GACGCACATA ACTAACCTCG TGTTTGGTAT TTTGCGCAAT GCCGGAGCAC TGATTCCCGG AGCGACACCC AATCTCGTTG TGTGCTGGGG TGGACACTCG ATTAATGAAG TTGAGTACCA ATACACTCGT GAAGTCGGCC ATGAGCTCGG TTTACGTGAA CTGAACATCT GTACTGGTTG TGGCCCCGGT GCGATGGAAG GCCCAATGAA AGGTGCGGCG GTCGGCCACG CCAAACAGCG TTATTCGGAA TACCGCTACC TCGGATTGAC CGAACCTTCA ATTATTGCCG CCGAGCCGCC CAACCCTATC GTTAATGAGC TGGTGATCAT GCCAGATATC GAAAAACGTT TGGAAGCGTT CGTGCGTATG GCACATGGCA TCATCATTTT CCCCGGAGGT CCCGGTACGG CGGAAGAGCT ACTGTATATC TTGGGCATTA TGATGCACCC AGAAAACGCC GATCAGCCAA TGCCTATTGT GCTGACAGGG CCAAAACAGA GTGAAGCGTA TTTCCGCTCT TTGGATAAAT TCATTACCGA TACCTTGGGT GAAGCCGCTC GCAAGCATTA CAGCATCGTC ATCGATAATC CAGCGGAAGC GGCACGTATT ATGAGTAATG CCATGCCACT GGTGCGCCAA CATCGTAAAG ATAAAGAAGA TGCGTACAGT TTTAACTGGT CACTCAAAAT TGAACCAGAA TTCCAACTGC CGTTTGAGCC TAACCACGAG AGCATGGCGA ACCTTGATTT GCACTTAAAC CAACGCCCAG AAGTGCTTGC CGCCAACCTG CGTCGAGCCT TCTCTGGTGT TGTGGCAGGC AACGTCAAAG CGGAAGGGAT CCGTGAAATT GAACGCCACG GTCCCTTTGA AATGCATGGT GACCCAGTGT TAATGAAAAA AATGGATCAG CTACTCAATG ATTTCGTTGC CCAAAACCGG ATGAAACTTC CAGGCGGCAG CGCATACGAG CCTTGCTATA AGATAGTGAC TTAA
|
Protein sequence | MKEDTMIIQV SPAGSMDLLS QLEVERLKKT ASSDLYQLYR NCSLAVLNSG SHTDNSKELL DKYKNFDITV MRRERGIKLE LANPPEHAFV DGQIIKGIQE HLFSVLRDIV YVNMHLADSQ RLNLTNATHI TNLVFGILRN AGALIPGATP NLVVCWGGHS INEVEYQYTR EVGHELGLRE LNICTGCGPG AMEGPMKGAA VGHAKQRYSE YRYLGLTEPS IIAAEPPNPI VNELVIMPDI EKRLEAFVRM AHGIIIFPGG PGTAEELLYI LGIMMHPENA DQPMPIVLTG PKQSEAYFRS LDKFITDTLG EAARKHYSIV IDNPAEAARI MSNAMPLVRQ HRKDKEDAYS FNWSLKIEPE FQLPFEPNHE SMANLDLHLN QRPEVLAANL RRAFSGVVAG NVKAEGIREI ERHGPFEMHG DPVLMKKMDQ LLNDFVAQNR MKLPGGSAYE PCYKIVT
|
| |