Gene VC0395_A1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1931 
Symbol 
ID5135338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2061684 
End bp2062940 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content49% 
IMG OID640533388 
ProductNupC family protein 
Protein accessionYP_001217855 
Protein GI147673370 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000124283 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTGT TTATGAGCCT CATCGGCATG GCAGTTCTGC TAGGAATCGC AGTTCTACTG 
TCAAGTAACC GTAAAGCTAT CAATCTAAGA ACTGTGGGTG GCGCTTTTGC TATCCAATTT
TCACTGGGTG CATTTATTCT GTATGTGCCT TGGGGCCAAG AGCTACTTCG TGGCTTTTCG
GATGCCGTAT CGAATGTTAT TAACTACGGT AACGATGGTA CTTCATTCCT CTTCGGTGGA
CTGGTATCAG GCAAAATGTT TGAAGTGTTT GGCGGCGGCG GTTTCATTTT CGCATTCCGC
GTACTACCAA CACTGATCTT CTTCTCAGCA CTGATTTCTG TACTGTACTA CTTGGGTGTT
ATGCAATGGG TTATCCGCAT TCTTGGCGGT GGTCTGCAAA AAGCACTGGG TACATCACGC
GCGGAATCTA TGTCTGCGGC TGCAAACATT TTCGTGGGTC AAACTGAAGC ACCATTAGTT
GTTCGTCCAT TCGTTCCAAA AATGACTCAA TCTGAGCTGT TTGCGGTAAT GTGTGGTGGC
TTGGCTTCTA TCGCAGGTGG TGTACTTGCG GGTTACGCTT CAATGGGCGT TAAGATCGAA
TACTTGGTAG CGGCGTCATT CATGGCGGCA CCGGGTGGTC TGCTGTTCGC AAAACTGATG
ATGCCTGAAA CTGAAAAACC ACAAGACAAT GAAGACATTA CTCTTGATGG TGGTGACGAC
AAACCGGCTA ACGTTATCGA TGCGGCTGCT GGCGGTGCTT CTGCTGGTCT GCAACTTGCT
CTGAACGTTG GTGCAATGTT GATTGCCTTT ATCGGTTTGA TTGCTCTGAT CAACGGTATG
TTGGGTGGCA TCGGTGGTTG GTTCGGTATG CCTGAACTGA AACTGGAAAT GCTACTGGGC
TGGTTGTTTG CGCCTCTGGC TTTCCTGATC GGTGTTCCTT GGAACGAAGC AACTGTTGCG
GGTGAGTTCA TCGGTCTAAA AACCGTTGCT AACGAATTCG TTGCTTACTC TCAGTTTGCG
CCTTACCTGA CTGAAGCGGC ACCAGTGGTT CTGTCTGAGA AAACCAAAGC GATCATCTCT
TTCGCTCTGT GTGGTTTTGC GAACCTTTCT TCTATCGCAA TTCTGCTTGG TGGTTTGGGT
AGCTTGGCAC CTAAGCGTCG TGGCGACATC GCTCGTATGG GGGTCAAAGC GGTTATCGCA
GGTACTCTAT CTAACCTGAT GGCAGCGACC ATCGCTGGCT TCTTCCTCTC TTTCTAA
 
Protein sequence
MSLFMSLIGM AVLLGIAVLL SSNRKAINLR TVGGAFAIQF SLGAFILYVP WGQELLRGFS 
DAVSNVINYG NDGTSFLFGG LVSGKMFEVF GGGGFIFAFR VLPTLIFFSA LISVLYYLGV
MQWVIRILGG GLQKALGTSR AESMSAAANI FVGQTEAPLV VRPFVPKMTQ SELFAVMCGG
LASIAGGVLA GYASMGVKIE YLVAASFMAA PGGLLFAKLM MPETEKPQDN EDITLDGGDD
KPANVIDAAA GGASAGLQLA LNVGAMLIAF IGLIALINGM LGGIGGWFGM PELKLEMLLG
WLFAPLAFLI GVPWNEATVA GEFIGLKTVA NEFVAYSQFA PYLTEAAPVV LSEKTKAIIS
FALCGFANLS SIAILLGGLG SLAPKRRGDI ARMGVKAVIA GTLSNLMAAT IAGFFLSF