Gene VC0395_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_1039 
Symbol 
ID5134093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp1018862 
End bp1020061 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content53% 
IMG OID640531361 
Producthypothetical protein 
Protein accessionYP_001215875 
Protein GI147671782 
COG category[S] Function unknown 
COG ID[COG3299] Uncharacterized homolog of phage Mu protein gp47 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA GACCGCAGGC CGACTTTGTC GAAATACTCT CAGAATCGGG CGTGCCAGTT 
ACTGAGGATG CCTTCGAGGC CGCGCTCAAA GCAGACGTGA CAGAGTCAGG AAGCCTCTTG
TCCAACGATT CGCAAATGTC ACCCTTCTGG CGTTGGGTTC GTGCTGCTGT TGTGACACCT
GCCGTGTGGC TGATCCGCAC ACTGCTCGCA GGGCATGTCA TGCCCAATAT CTTTGTGGGT
ACGGCGGAAC GTTGGGCGCT AGAGCTAAAA GCATGGGAAT ACAACGTCAC GCCCAAAGGC
GCAGTGAGCA CCCAAGGCTT AATCACCTTC ACCAAAGCCA ACGCCGCAGA TGAAACCAGT
ATCGAAGCAG GAACCATCAT TCAAACGCCA GAGATTGAAG GCAAGGTGTA CAAACTTACC
GCAATAAAAA CCACGGTGAT CAAAGCGGGG CAAGCCTCCG GCAAAGTCTT GTGTGAAGCC
AGTGAAGCGG GAGCCGCTTA CAACCTGCCC GCCGGCTATT TCAGCATTCT GCCGCAGGGC
GTATCGGGCA TTGTCTCTGT CACCAATGAA GCGAATTGGA TAACCCAACT CGGCGCAGAC
CAAGAAAGCG ACGAAGAATT AGCCCTACGC CTACAAAACG CCTTTACCAG TGCGGGCGAA
TGGCACATTG ACGATGTTTA CCGCGCCATG ATTGCCAGCG TGGCGGGGAT CCGTAGTGAT
AACATCTTCT TTGAAAACAC AGGCCACATC ACACCGGGTA GCGCGAATGC TTACATTCTG
ATGGAAGTGG GCGCAACGCC ACAGCATGTG CTTGACCAAC TCAATAAACA TATCATGCAA
GACGGCCACC ACGGCCACGG TGACGTGCTG ACTTGTTTAG CCATCCCAGA GACTCAGCAC
AGCATCAGTG CGCAGGTGGT CTTTGTCGCG AATCTCGATG AGATGCAGAA AATCAATGAA
CTGCTGGAAG TAGAAAACCG CATTCGTGCC GCATTCCGTG AAACAGCGGC TTATCCAGAA
ATGACCAGAG CGAAACCAGA AAGCCGATTC AGCATTTCAC AGCTCGCCCA TGAAATTCAC
AGCAAGATGG AGAACGTCGA ATCCGTACTC ATCAAAGTAG ACGGTGAACC AACCGACATC
ATCAGCTTGC TCACTCAACC CCGCTTACAA ACCCTCACCG TCACGGAGCT GGAACAATGA
 
Protein sequence
MSKRPQADFV EILSESGVPV TEDAFEAALK ADVTESGSLL SNDSQMSPFW RWVRAAVVTP 
AVWLIRTLLA GHVMPNIFVG TAERWALELK AWEYNVTPKG AVSTQGLITF TKANAADETS
IEAGTIIQTP EIEGKVYKLT AIKTTVIKAG QASGKVLCEA SEAGAAYNLP AGYFSILPQG
VSGIVSVTNE ANWITQLGAD QESDEELALR LQNAFTSAGE WHIDDVYRAM IASVAGIRSD
NIFFENTGHI TPGSANAYIL MEVGATPQHV LDQLNKHIMQ DGHHGHGDVL TCLAIPETQH
SISAQVVFVA NLDEMQKINE LLEVENRIRA AFRETAAYPE MTRAKPESRF SISQLAHEIH
SKMENVESVL IKVDGEPTDI ISLLTQPRLQ TLTVTELEQ