Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1931 |
Symbol | |
ID | 5135338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2061684 |
End bp | 2062940 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640533388 |
Product | NupC family protein |
Protein accession | YP_001217855 |
Protein GI | 147673370 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000124283 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGT TTATGAGCCT CATCGGCATG GCAGTTCTGC TAGGAATCGC AGTTCTACTG TCAAGTAACC GTAAAGCTAT CAATCTAAGA ACTGTGGGTG GCGCTTTTGC TATCCAATTT TCACTGGGTG CATTTATTCT GTATGTGCCT TGGGGCCAAG AGCTACTTCG TGGCTTTTCG GATGCCGTAT CGAATGTTAT TAACTACGGT AACGATGGTA CTTCATTCCT CTTCGGTGGA CTGGTATCAG GCAAAATGTT TGAAGTGTTT GGCGGCGGCG GTTTCATTTT CGCATTCCGC GTACTACCAA CACTGATCTT CTTCTCAGCA CTGATTTCTG TACTGTACTA CTTGGGTGTT ATGCAATGGG TTATCCGCAT TCTTGGCGGT GGTCTGCAAA AAGCACTGGG TACATCACGC GCGGAATCTA TGTCTGCGGC TGCAAACATT TTCGTGGGTC AAACTGAAGC ACCATTAGTT GTTCGTCCAT TCGTTCCAAA AATGACTCAA TCTGAGCTGT TTGCGGTAAT GTGTGGTGGC TTGGCTTCTA TCGCAGGTGG TGTACTTGCG GGTTACGCTT CAATGGGCGT TAAGATCGAA TACTTGGTAG CGGCGTCATT CATGGCGGCA CCGGGTGGTC TGCTGTTCGC AAAACTGATG ATGCCTGAAA CTGAAAAACC ACAAGACAAT GAAGACATTA CTCTTGATGG TGGTGACGAC AAACCGGCTA ACGTTATCGA TGCGGCTGCT GGCGGTGCTT CTGCTGGTCT GCAACTTGCT CTGAACGTTG GTGCAATGTT GATTGCCTTT ATCGGTTTGA TTGCTCTGAT CAACGGTATG TTGGGTGGCA TCGGTGGTTG GTTCGGTATG CCTGAACTGA AACTGGAAAT GCTACTGGGC TGGTTGTTTG CGCCTCTGGC TTTCCTGATC GGTGTTCCTT GGAACGAAGC AACTGTTGCG GGTGAGTTCA TCGGTCTAAA AACCGTTGCT AACGAATTCG TTGCTTACTC TCAGTTTGCG CCTTACCTGA CTGAAGCGGC ACCAGTGGTT CTGTCTGAGA AAACCAAAGC GATCATCTCT TTCGCTCTGT GTGGTTTTGC GAACCTTTCT TCTATCGCAA TTCTGCTTGG TGGTTTGGGT AGCTTGGCAC CTAAGCGTCG TGGCGACATC GCTCGTATGG GGGTCAAAGC GGTTATCGCA GGTACTCTAT CTAACCTGAT GGCAGCGACC ATCGCTGGCT TCTTCCTCTC TTTCTAA
|
Protein sequence | MSLFMSLIGM AVLLGIAVLL SSNRKAINLR TVGGAFAIQF SLGAFILYVP WGQELLRGFS DAVSNVINYG NDGTSFLFGG LVSGKMFEVF GGGGFIFAFR VLPTLIFFSA LISVLYYLGV MQWVIRILGG GLQKALGTSR AESMSAAANI FVGQTEAPLV VRPFVPKMTQ SELFAVMCGG LASIAGGVLA GYASMGVKIE YLVAASFMAA PGGLLFAKLM MPETEKPQDN EDITLDGGDD KPANVIDAAA GGASAGLQLA LNVGAMLIAF IGLIALINGM LGGIGGWFGM PELKLEMLLG WLFAPLAFLI GVPWNEATVA GEFIGLKTVA NEFVAYSQFA PYLTEAAPVV LSEKTKAIIS FALCGFANLS SIAILLGGLG SLAPKRRGDI ARMGVKAVIA GTLSNLMAAT IAGFFLSF
|
| |