Gene VC0395_A0514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0514 
SymbolnagC 
ID5135116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp549238 
End bp550452 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content47% 
IMG OID640531972 
ProductN-acetylglucosamine repressor 
Protein accessionYP_001216465 
Protein GI147673497 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGCG GACAGATTGG TAACGTAGAT TTAGTTAAAC AGCTAAACAG TGCGGCGGTT 
TACCGCTTGA TTGACCAACA AGGCCCAATC AGTCGGATTC AAGTCGCTGA TGTAAGCCAA
CTGGCTCCGG CCAGTGTTAC CAAGATTACC CGCCAATTGC TTGAACGTGG ATTAATCAAA
GAAGTCGCGC AACAAGCTTC TACAGGTGGT CGGCGAGCCA TTTCGCTGAC CACAGAAGTC
AAGCCGTTCC ACTCTATTGC CGTGCGTATC GGACGTGACT ATATTCAACT CAGCCTCTAT
GACCTAGGTG GAAACTCACT GGTTGATGAG CATCATGAAT TTCACTACAA CACGCAAGAC
GTGTTGATGT CGAGCCTGAT TAAACAGATC AAACAATTTA TTCAACAACA CACCGCATTA
ATTGATCAGT TGATTGCGAT TGGTGTGGCC TTACCCGGTT TGGTGAACCC AGAAACAGGT
GTTGTTGAGT ACATGCCTAA TGTCGCGATC AATGAATTAC CTCTCGGCGC AACCATTCGT
GATGAGTTCC ATGTTGAATG TTTTGTGGGT AACGACGTTC GTGGTATTGC CTTAGCGGAG
CACTATTTCG GAGCAAGCCA AGATTGCCAA GACTCCATAC TAGTCAGTGT ACACCGTGGC
ACAGGTGCAG GGATTATCGT TAACGGTCAA GTCTTCTTAG GCTATAACCG CAACGTCGGT
GAAATTGGCC ACATCCAAAT CGACCCTCTT GGTGAGCAAT GTCAATGTGG CAACTTTGGT
TGTCTAGAAA CCGTTGCGAC CAACCCTGCT ATCACTTCTC GTGTGCAAAA ACTGATCGCT
CAGGGGTATG AATCTTCGCT CTCTACTCTC GATACGATTA CGATTGATGA TGTCTGTGAG
CACGCAAATG CTGGGGATGA ACTGGCGAAA CAAGCGTTGG TTCGTGTAGG TAATCAGTTG
GGTAAGGCGA TTGCGATTAC CGTAAACCTA TTCAACCCAC AAAAGATCGT GATTGCTGGG
CAAATCACCG CCGCCAAAGA GATCGTGTTC CCCGCGATTC AACGCAACGT AGAAAATCAG
TCACTGAAAA CGTTCCACCA ACATCTGCCG ATTGTGTCTT CACAAGTCTA CAAACAACCC
ACTATGGGCG CTTTTGCTAT GATCAAACGT GCGATGTTAA ATGGGGTTCT GTTACAAAAA
TTGCTTGAAG ACTGA
 
Protein sequence
MNGGQIGNVD LVKQLNSAAV YRLIDQQGPI SRIQVADVSQ LAPASVTKIT RQLLERGLIK 
EVAQQASTGG RRAISLTTEV KPFHSIAVRI GRDYIQLSLY DLGGNSLVDE HHEFHYNTQD
VLMSSLIKQI KQFIQQHTAL IDQLIAIGVA LPGLVNPETG VVEYMPNVAI NELPLGATIR
DEFHVECFVG NDVRGIALAE HYFGASQDCQ DSILVSVHRG TGAGIIVNGQ VFLGYNRNVG
EIGHIQIDPL GEQCQCGNFG CLETVATNPA ITSRVQKLIA QGYESSLSTL DTITIDDVCE
HANAGDELAK QALVRVGNQL GKAIAITVNL FNPQKIVIAG QITAAKEIVF PAIQRNVENQ
SLKTFHQHLP IVSSQVYKQP TMGAFAMIKR AMLNGVLLQK LLED