Gene VC0395_A0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0003 
SymbolmutY 
ID5135715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1199 
End bp2260 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content51% 
IMG OID640531463 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001215977 
Protein GI147675529 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000019513 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTCCTT TTGCACAGGC CATCCTTACT TGGTATGACG CCTACGGCCG CAAAAACCTG 
CCGTGGCAAC AAAATAAAAA TGCGTATCGC GTTTGGTTAT CGGAAATCAT GTTACAGCAG
ACTCAAGTCG CGACCGTGAT CCCCTACTTT GAACGCTTTT TAGAGCGCTT CCCGACCGTA
CACGCCCTCG CGGCAGCGCC GCAAGATGAA GTGCTGCATT TCTGGACGGG GCTTGGCTAC
TACGCCAGAG CGCGCAATCT GCATAAAGCA GCGAAAATGG TTGTGAGTGA ATATAGCGGC
GAATTTCCCA CCGATTTAGA GCAGATGAAT GCGCTACCCG GTGTTGGCCG TTCCACCGCG
GCAGCCGTGC TCTCTTCTGT GTATAAAAAA CCACACGCCA TTTTGGATGG CAACGTGAAA
CGCACGTTAG CGCGCTGCTT TGCCGTTGAA GGTTGGCCGG GGCAAAAAAG TGTCGAAAAC
CAGCTTTGGC ATTATGCAGA AATGCACACG CCCAAAGTGG ATGTTGATAA ATACAACCAA
GCCATGATGG ATATGGGCGC AATGATCTGC ATTCGCAGTA AGCCCAAATG CAGCCTGTGC
CCAGTAGAAT CGTTTTGCCT TGCCAAGCAG CAAGGCAATC CCCAAGAGTA TCCGGGCAAG
AAACCGAAAA CAGATAAACC CGTCAAAGCC ACTTGGTTTG TCATGCTCTA TCACGACAAT
GCCGTCTGGC TTGAGCAGCG CCCGCAAAGC GGAATTTGGG GCGGTTTGTA CTGCTTCCCG
CAATCAGAGA TCGCCAATAT TCAAACCACC ATAGATCAGC GCGCCATCGG CGATAGCACA
ATAACATCGC AGAAAACCCT GATCGCATTT CGCCACACCT TTAGCCACTA CCATCTCGAT
ATTACGCCGA TTTTGCTGCA ATTAAGCCGC AAACCGGACA TCGTCATGGA AGGGAGCAAA
GGTCTTTGGT ATAACTTAAG TCAACCCGAT GAGATTGGTC TCGCGGCACC AGTGAAACAA
CTGTTGCACA GCTTACCTTT CGACATTGAT AGCCACATTT AA
 
Protein sequence
MTPFAQAILT WYDAYGRKNL PWQQNKNAYR VWLSEIMLQQ TQVATVIPYF ERFLERFPTV 
HALAAAPQDE VLHFWTGLGY YARARNLHKA AKMVVSEYSG EFPTDLEQMN ALPGVGRSTA
AAVLSSVYKK PHAILDGNVK RTLARCFAVE GWPGQKSVEN QLWHYAEMHT PKVDVDKYNQ
AMMDMGAMIC IRSKPKCSLC PVESFCLAKQ QGNPQEYPGK KPKTDKPVKA TWFVMLYHDN
AVWLEQRPQS GIWGGLYCFP QSEIANIQTT IDQRAIGDST ITSQKTLIAF RHTFSHYHLD
ITPILLQLSR KPDIVMEGSK GLWYNLSQPD EIGLAAPVKQ LLHSLPFDID SHI