Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_1098 |
Symbol | |
ID | 5135032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | - |
Start bp | 1063150 |
End bp | 1064358 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640531420 |
Product | NupC family protein |
Protein accession | YP_001215934 |
Protein GI | 147672143 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATTT TGTTTGGAAT CATCGGTGTT ACGGTACTGA TCTTATGCGC GTATCTGCTC TCTGAAAGCC GCAGTGCGAT TAATTGGAAA ACCATTTCCC GAGCCTTGTT GTTGCAAATT GGTTTTGCGG CTCTTGTGCT TTATTTCCCA TTGGGGCAAA CCGCGCTAAG CAGCTTGAGT AATGGGGTTT CTGGTTTGCT CGGTTTTGCC GATGTCGGCA TTCGCTTTCT GTTTGGTGAT CTTGCCGATA CGGGCTTTAT TTTTGCTGTT CGTGTATTAC CTATCATCAT CTTCTTCAGT GCGCTGATTT CTGCCCTTTA TTACCTCGGT GTGATGCAAA AAGTGATCGC CCTGATCGGC GGTGGCATTC AACGCTTCTT AGGCACCAGT AAGGCGGAAT CACTGGTCGC GACAGGCAAT ATTTTCCTAT CACAAGGCGA ATCGCCACTT TTGGTGCGCC CCTTCCTTGC CAATATGACA CGCTCCGAAC TGTTTGCGGT CATGGCGGGC GGTATGGCAT CGATAGCAGG CTCAGTGCTG GGGGGTTACG CAGGTTTAGG GGTTGAGCTG AAATACTTGA TTGCAGCGAG TTTCATGGCG GCGCCGGGCA GTTTAATGAT GGCGAAAATC ATTGTTCCTG AGCGTGGTGT GCCAATCGAT CAAAGCCAAG TCGAGTTGGA TAAAGCGCAA GACAGCAACT TGATTGATGC TCTCGCTAGC GGTGCAATGA ATGGTATGAA AGTCGCCGTT GCAGTGGGCA CTATGTTGAT TGCGTTCGTC AGCGTGATCG CTATGGTCAA CACTGGCCTT GAAAATCTGG GCGATCTGGT TGGGTTTAGC GGCATTACCT TACAAGCCAT GTTCGGTTAT CTGTTTGCTC CACTGGCATG GGTGATTGGC ATTCCAAGTC ACGAAGTGCT GGCGGCAGGT TCCTACATCG GTCAGAAAGT GGTGATGAAC GAATTTGTCG CTTTCATTGA CTTTGTTGAG CATAAAGCGC TGCTTTCTGA GCATAGCCAA GTCATCATCA CGTTTGCATT GTGTGGCTTT GCCAACATTG GCTCTATCGC GATCCAATTG GGCTCCATTG GCGTGATAGC CCCTGAGCGC CGCTCGGAAG TGGCGAACCT AGGCATTAAA GCGGTCATTG CTGGCACTTT AGCCAACCTA ATGAGCGCTT GCTTAGCGGG GATTTTCATC TCGCTATAA
|
Protein sequence | MAILFGIIGV TVLILCAYLL SESRSAINWK TISRALLLQI GFAALVLYFP LGQTALSSLS NGVSGLLGFA DVGIRFLFGD LADTGFIFAV RVLPIIIFFS ALISALYYLG VMQKVIALIG GGIQRFLGTS KAESLVATGN IFLSQGESPL LVRPFLANMT RSELFAVMAG GMASIAGSVL GGYAGLGVEL KYLIAASFMA APGSLMMAKI IVPERGVPID QSQVELDKAQ DSNLIDALAS GAMNGMKVAV AVGTMLIAFV SVIAMVNTGL ENLGDLVGFS GITLQAMFGY LFAPLAWVIG IPSHEVLAAG SYIGQKVVMN EFVAFIDFVE HKALLSEHSQ VIITFALCGF ANIGSIAIQL GSIGVIAPER RSEVANLGIK AVIAGTLANL MSACLAGIFI SL
|
| |