Gene VC0395_A2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2088 
SymbolhepA 
ID5135189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2246656 
End bp2249565 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content52% 
IMG OID640533544 
ProductATP-dependent helicase HepA 
Protein accessionYP_001218004 
Protein GI147673548 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000725404 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTTG CTTTAGGGCA ACGCTGGATA AGCGATACGG AAAGTGATCT CGGGTTAGGT 
ACTGTGGTGG CGCTTGATGC GCGTACAGTG ACCTTGATGT TTGCAGCATC GGAAGAAAAT
CGTGTGTATG CACGTTCAGA TGCGCCGGTC ACCCGTGTGA TCTTTAATGT CGGCGATGTG
GTGGATAGTC AGCAAGGCTG GTCGTTACAA GTTGAACAAG TGGTGGAAGA CCAAGGCGTC
TACACCTATC TAGGCACCCG AGTGGATACC GAAGAAAGCG GTGTCGCGCT GCGTGAAATT
TTCCTCAGTA ACCAAATTCG TTTTAATAAA CCGCAGGACA AGTTGTTTGC CGGTCAAATC
GATCGCATGG ACAACTTCGT GCTGCGTTAT CGCGCTCTAG CCAATCAATA TCAACAGCAC
AAAAGCCCAA TGCGTGGCTT GTGTGGTATG CGCGCTGGGC TGATCCCGCA TCAGTTGTAC
ATTGCCCATG AAGTCGGTCG CCGCCATGCG CCACGCGTAT TGTTAGCCGA TGAAGTCGGT
TTGGGTAAAA CCATCGAAGC GGGCATGATC ATCCATCAGC AAGTGCTGAC CGGGCGTGCC
GAGCGCATCC TGATTGTGGT GCCTGAAACG CTGCAACACC AATGGTTGGT GGAGATGATG
CGCCGTTTCA ACCTGCACTT TTCGATTTTT GATGAAGAGC GCTGTGTGGA AGCCTTTAGC
GAGGCCGATA ACCCATTTGA AACTCAGCAG TATGTGCTGT GTTCACTCGA TTTCTTGCGT
AAAAGCCGCC AGCGTTTTGA ACAAGCCTTG GAAGCGGAAT GGGATTTATT GGTCGTCGAT
GAAGCTCACC ATCTTGAGTG GCACCCAGAA AAGCCAAGCC GCGAATATCA AGTGATTGAA
GCTCTGGCCG AACAGACTCC GGGCGTACTG CTATTGACGG CAACGCCTGA GCAGTTAGGT
CGTGAAAGCC ACTTTGCTCG TCTGCGCCTG CTGGATGCAG ATCGTTTCTA CGATTACGAA
GCCTTTGTTA AAGAAGAAGA GCAATACGCA CCCGTTGCCG ATGCCGTTAC TGCGCTGTTC
AGCGGTGAGA AGCTGAGTGA TGAAGCGAAA AACAAAATCA CTGAGCTATT GTCTGAGCAA
GATGTGGAGC CGCTGTTTAA AGCGTTGGAA AGCCACGCCA GCGAGGACGA AATTGCTTTG
GCGCGCCAAG AGCTGATCGA CAATCTGATG GATCGCCACG GTACTGGGCG TGTGTTGTTC
CGTAACACGC GTGCGGCAAT CAAAGGCTTC CCAGTGCGTA ATGTGCATTT GCTGCCATTG
GAGATCCCTT CTCAATACAC CACGTCAATG CGTGTTGCGG GCATGCTCGG CGGTAAACTG
ACGCCAGAAG CGCGCGCGAT GAAAATGCTC TACCCGGAAG AGATTTTCCA AGAGTTTGAA
GGTGACGAAT CAAGCTGGTG GCAATTTGAC TCACGCGTTA ACTGGTTGCT CGAAAAAGTC
AAAGCCAAAC GCAGCGAGAA GATCCTCGTC ATCGCTTCAC GTGCCAGCAC CGCGTTGCAG
CTAGAGCAGG CCTTGCGTGA ACGTGAAGGC ATTCGCGCAA CGGTATTTCA TGAAGGCATG
TCGATTATTG AGCGTGACAA AGCCGCTGCT TACTTTGCCC AAGAAGAGGG CGGTGCGCAG
GTGTTGATCT GTAGTGAAAT CGGCTCCGAA GGTCGTAACT TCCAGTTTGC GAACCAATTG
GTGATGTTTG ATCTGCCGTT CAACCCAGAC TTGCTGGAGC AGCGTATCGG GCGTTTGGAC
CGTATCGGCC AAAAGCGTGA TATCGATGTG TACGTGCCGT ATTTGACCGA AACGTCACAA
GCGATTTTGG CGCGTTGGTT TCAAGAAGGT TTGAATGCCT TTGCGGAAAC CTGTCCAACG
GGTCGTGCGG TGTACGATGC CTTCGCTGAG CGTTTAATCC CAATTCTGGC CGCTGGTGGC
GGTGAAGAGC TGGAAGTGAT CATCGAAGAG TCAGCCAAGC TCAACAAAAC ACTGAAATCG
CAGCTGGAAG TCGGGCGTGA TCGCTTGTTG GAAATGCATT CTAACGGTGG CGAAAAAGCG
CAGCAGATTG CCGAGCAGAT CGCGAAAACC GATGGCGATA CCAATCTGGT GACGTTTGCC
TTGAGCCTGT TTGATGCGAT TGGTCTGCAT CAGGAAGATC GTGGCGAGAA TGCTCTAGTG
GTCACTCCTG CCGAACACAT GATGGTGCCA AGCTATCCGG GCCTCCCTTA TGAAGGCGCC
ACCATTACCT TTGATCGTGA CACCGCACTG TCGCGTGAAG ATATGCACTT CATCAGTTGG
GAACACCCCA TGGTGCAAGG TGGCATTGAT TTACTGATGA GTGAAGGGGT GGGCACTTGC
GCGGTGTCGC TGTTGAAAAA CAAAGCACTG CCAGTGGGTA CTATCTTGCT GGAGCTGGTG
TATGTGGTGG ATGCCCAAGC GCCGAAACGC AGTGGCATCA GCCGCTTCTT GCCTGTCTCG
CCAATCCGAA TTCTGATGGA TGCGCGCGGT AATGATCTCT CTTCGCAAGT GGAGTTTGAA
AGCTTTAACC GTCAGCTCAG CCCAGTGAAT CGTCATTTGG CCAGCAAGCT GGTGAGTTCA
GTACAGCATG ACGTGCATCG TTTGATTACG GCAAGTGAAG CGGCGGTTGA ACCGCGAGTG
AGCGCGATTC GTGAACAAGC GCAGCGTGAT ATGCAGCAGA GCCTTAACAG CGAGTTGGAG
CGTTTGCTGG CACTCAAAGC GGTTAACCCG AACATTCGTG ATGAAGAGAT CGAAGTGCTG
GAGCAGCAAA TCAAAGAGTT GACTGGCTAT ATTGCGCAGG CGCAGTATCA GCTGGATTCA
CTGCGTTTGA TTGTGGTGGC ACACAACTGA
 
Protein sequence
MSFALGQRWI SDTESDLGLG TVVALDARTV TLMFAASEEN RVYARSDAPV TRVIFNVGDV 
VDSQQGWSLQ VEQVVEDQGV YTYLGTRVDT EESGVALREI FLSNQIRFNK PQDKLFAGQI
DRMDNFVLRY RALANQYQQH KSPMRGLCGM RAGLIPHQLY IAHEVGRRHA PRVLLADEVG
LGKTIEAGMI IHQQVLTGRA ERILIVVPET LQHQWLVEMM RRFNLHFSIF DEERCVEAFS
EADNPFETQQ YVLCSLDFLR KSRQRFEQAL EAEWDLLVVD EAHHLEWHPE KPSREYQVIE
ALAEQTPGVL LLTATPEQLG RESHFARLRL LDADRFYDYE AFVKEEEQYA PVADAVTALF
SGEKLSDEAK NKITELLSEQ DVEPLFKALE SHASEDEIAL ARQELIDNLM DRHGTGRVLF
RNTRAAIKGF PVRNVHLLPL EIPSQYTTSM RVAGMLGGKL TPEARAMKML YPEEIFQEFE
GDESSWWQFD SRVNWLLEKV KAKRSEKILV IASRASTALQ LEQALREREG IRATVFHEGM
SIIERDKAAA YFAQEEGGAQ VLICSEIGSE GRNFQFANQL VMFDLPFNPD LLEQRIGRLD
RIGQKRDIDV YVPYLTETSQ AILARWFQEG LNAFAETCPT GRAVYDAFAE RLIPILAAGG
GEELEVIIEE SAKLNKTLKS QLEVGRDRLL EMHSNGGEKA QQIAEQIAKT DGDTNLVTFA
LSLFDAIGLH QEDRGENALV VTPAEHMMVP SYPGLPYEGA TITFDRDTAL SREDMHFISW
EHPMVQGGID LLMSEGVGTC AVSLLKNKAL PVGTILLELV YVVDAQAPKR SGISRFLPVS
PIRILMDARG NDLSSQVEFE SFNRQLSPVN RHLASKLVSS VQHDVHRLIT ASEAAVEPRV
SAIREQAQRD MQQSLNSELE RLLALKAVNP NIRDEEIEVL EQQIKELTGY IAQAQYQLDS
LRLIVVAHN