Gene VC0395_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0423 
Symbol 
ID5134909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp468183 
End bp469640 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content50% 
IMG OID640530746 
ProductN-acetylglucosamine-binding protein A 
Protein accessionYP_001215264 
Protein GI147672074 
COG category[S] Function unknown 
COG ID[COG3397] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0208095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AACCTAAAAT GACCGCTATC GCCCTGATCC TCTCTGGTAT CAGTGGATTA 
GCGTATGGAC ACGGCTACGT TTCCGCAGTG GAAAACGGTG TCGCCGAAGG ACGTGTCACC
TTGTGTAAAT TTGCCGCTAA CGGCACTGGA GAAAAAAACA CTCACTGTGG CGCGATTCAA
TACGAACCAC AAAGTGTCGA AGGCCCAGAT GGCTTCCCGG TCACTGGCCC TCGCGATGGC
AAAATTGCCA GTGCGGAATC GGCACTGGCG GCAGCGCTGG ATGAGCAAAC CGCCGACCGT
TGGGTAAAGC GCCCAATTCA AGCTGGCCCG CAAACCTTCG AGTGGACGTT TACCGCCAAC
CACGTCACAA AGGATTGGAA ATACTACATT ACCAAACCAA ACTGGAACCC AAACCAGCCA
TTGTCGCGTG ATGCATTTGA CCTCAATCCG TTCTGTGTCG TTGAAGGAAA TATGGTGCAG
CCACCAAAAC GTGTCAGCCA CGAATGTATC GTGCCTGAGC GCGAAGGGTA TCAGGTCATC
CTCGCCGTAT GGGATGTGGG CGATACCGCA GCTTCCTTCT ACAACGTGAT CGACGTGAAA
TTTGACGGTA ACGGCCCAGT GTTACCGGAT TGGAACCCAG CAGGTCAAAT CATTCCAAGT
ATGGATCTCA GCATTGGCGA TACCGTGTAC ACTCGCGTGT TTGATAACGA GGGGGAAAAC
CCCGCTTATC GCACTGAGCT GAAAATTGAC TCTGAGACGC TAACCAAAGC CAATCAATGG
TCTTACGCTC TGGCGACTAA AATTAACCAA ACGCAAAAAC AGCAACGTGC TGGTCAGCTT
AATGGCGATC AATTTGTTCC CGTTTACGGC ACCAACCCGA TTTATCTGAA AGAAGGCAGT
GGCTTGAAGA GTGTTGAAAT TGGCTACCAA ATTGAAGCGC CACAGCCTGA GTATTCACTG
ACGGTTTCTG GTCTAGCGAA AGAGTATGAG ATTGGCGAAC AACCGATTCA GCTTGACCTG
ACTTTAGAAG CGCAAGGTGA AATGAGCGCA GAGCTGACGG TTTATAACCA CCACCAAAAG
CCGCTGGCAA GTTGGTCACA AGCGATGACG GATGGCGAGC TGAAATCAGT AACCTTAGAA
CTGAGTGAAG CCAAAGCTGG ACACCACATG CTGGTTTCTC GCATCAAAGA TCGCGATGGC
AATCTGCAAG ATCAACAAAC TCTCGATTTC ATGCTGGTTG AACCGCAAAC ACCACCAACA
CCGGGTGACT ACGACTTTGT TTTCCCGAAT GGCCTGAAAG AGTACGTGGC TGGCACCAAA
GTGCTCGCTA GTGATGGCGC AATCTACCAA TGTAAGCCAT GGCCATACTC TGGCTACTGC
CAGCAATGGA CAAGTAACGC TACTCAATAC CAACCGGGTA CCGGCAGTCA TTGGGAAATG
GCGTGGGATA AACGTTAA
 
Protein sequence
MKKQPKMTAI ALILSGISGL AYGHGYVSAV ENGVAEGRVT LCKFAANGTG EKNTHCGAIQ 
YEPQSVEGPD GFPVTGPRDG KIASAESALA AALDEQTADR WVKRPIQAGP QTFEWTFTAN
HVTKDWKYYI TKPNWNPNQP LSRDAFDLNP FCVVEGNMVQ PPKRVSHECI VPEREGYQVI
LAVWDVGDTA ASFYNVIDVK FDGNGPVLPD WNPAGQIIPS MDLSIGDTVY TRVFDNEGEN
PAYRTELKID SETLTKANQW SYALATKINQ TQKQQRAGQL NGDQFVPVYG TNPIYLKEGS
GLKSVEIGYQ IEAPQPEYSL TVSGLAKEYE IGEQPIQLDL TLEAQGEMSA ELTVYNHHQK
PLASWSQAMT DGELKSVTLE LSEAKAGHHM LVSRIKDRDG NLQDQQTLDF MLVEPQTPPT
PGDYDFVFPN GLKEYVAGTK VLASDGAIYQ CKPWPYSGYC QQWTSNATQY QPGTGSHWEM
AWDKR