Gene VC0395_A1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1819 
SymbolpurM 
ID5136261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1933885 
End bp1934925 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content50% 
IMG OID640533276 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001217743 
Protein GI147675067 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGGTA ATAATCCATC TCTCAGCTAC AAAGATGCAG GTGTCGATAT TGATGCAGGT 
AATGCACTCG TTGAACGAAT TAAAGGCGCC GTGAAGCGCA CCCGTCGCCC TGAAGTTATG
GGTGGCCTAG GTGGTTTTGG CGCACTGTGT GAACTGCCAA CCAAATACAA GCACCCTGTT
TTAGTCTCTG GCACTGACGG CGTAGGTACC AAACTGCGTC TCGCTCTGGA TATGAAAAAA
CACGACACCA TAGGTATCGA TCTGGTGGCG ATGTGCGTCA ATGATCTGAT TGTTCAAGGC
GCTGAGCCGC TGTTTTTCCT CGATTATTAC GCGACAGGCA AACTGGATGT GGATACGGCT
GCTGAAGTGA TTTCTGGTAT TGCCGATGGC TGTTTGCAAG CGGGCTGCGC GCTGATTGGC
GGCGAAACCG CGGAAATGCC AGGCATGTAC GAAGGTGAAG ACTACGACGT GGCAGGTTTT
TGTGTCGGTG TCGTCGAAAA AGAAGAGATC ATCGACGGCA GTAAGGTACA AGTGGGTGAT
GCGCTGATTG CGGTTGGCTC AAGCGGCCCA CACTCCAACG GTTACTCGCT GGTACGTAAG
ATTTTAGAAG TCTCTAAAGC CGATAAGAAT GAGCGGTTAG CAGGCAAAAC CATTGGTGAG
CACTTACTCG CACCGACCAA AATTTATATC AAATCTGGCT TAAAGCTGAT TGCTGAACAT
GACATTCATG CGATTTCACA CATCACTGGC GGTGGCTTCT GGGAAAACAT TCCACGCGTA
TTGCCAGAAG GTACAAAAGC CGTGATCGAT GGTAAGAGCT GGGAATGGCC AGTGATTTTC
CAATGGTTAC AGGAAAAAGG TAACGTGACC ACTCACGAAA TGTACCGCAC CTTCAACTGT
GGTGTCGGTT TGATCATTGC ACTGCCAAAA GATCAAGCCA ATGCGGCCGT TGCGCTACTG
CAAGCAGAAG GCGAAACCGC ATGGGTCATC GGCGAAATCG CAGCCGCCAA TAGCAACGAA
GCACAGGTAG AGATCAACTA A
 
Protein sequence
MSGNNPSLSY KDAGVDIDAG NALVERIKGA VKRTRRPEVM GGLGGFGALC ELPTKYKHPV 
LVSGTDGVGT KLRLALDMKK HDTIGIDLVA MCVNDLIVQG AEPLFFLDYY ATGKLDVDTA
AEVISGIADG CLQAGCALIG GETAEMPGMY EGEDYDVAGF CVGVVEKEEI IDGSKVQVGD
ALIAVGSSGP HSNGYSLVRK ILEVSKADKN ERLAGKTIGE HLLAPTKIYI KSGLKLIAEH
DIHAISHITG GGFWENIPRV LPEGTKAVID GKSWEWPVIF QWLQEKGNVT THEMYRTFNC
GVGLIIALPK DQANAAVALL QAEGETAWVI GEIAAANSNE AQVEIN