Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2088 |
Symbol | hepA |
ID | 5135189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2246656 |
End bp | 2249565 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640533544 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_001218004 |
Protein GI | 147673548 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000725404 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTTG CTTTAGGGCA ACGCTGGATA AGCGATACGG AAAGTGATCT CGGGTTAGGT ACTGTGGTGG CGCTTGATGC GCGTACAGTG ACCTTGATGT TTGCAGCATC GGAAGAAAAT CGTGTGTATG CACGTTCAGA TGCGCCGGTC ACCCGTGTGA TCTTTAATGT CGGCGATGTG GTGGATAGTC AGCAAGGCTG GTCGTTACAA GTTGAACAAG TGGTGGAAGA CCAAGGCGTC TACACCTATC TAGGCACCCG AGTGGATACC GAAGAAAGCG GTGTCGCGCT GCGTGAAATT TTCCTCAGTA ACCAAATTCG TTTTAATAAA CCGCAGGACA AGTTGTTTGC CGGTCAAATC GATCGCATGG ACAACTTCGT GCTGCGTTAT CGCGCTCTAG CCAATCAATA TCAACAGCAC AAAAGCCCAA TGCGTGGCTT GTGTGGTATG CGCGCTGGGC TGATCCCGCA TCAGTTGTAC ATTGCCCATG AAGTCGGTCG CCGCCATGCG CCACGCGTAT TGTTAGCCGA TGAAGTCGGT TTGGGTAAAA CCATCGAAGC GGGCATGATC ATCCATCAGC AAGTGCTGAC CGGGCGTGCC GAGCGCATCC TGATTGTGGT GCCTGAAACG CTGCAACACC AATGGTTGGT GGAGATGATG CGCCGTTTCA ACCTGCACTT TTCGATTTTT GATGAAGAGC GCTGTGTGGA AGCCTTTAGC GAGGCCGATA ACCCATTTGA AACTCAGCAG TATGTGCTGT GTTCACTCGA TTTCTTGCGT AAAAGCCGCC AGCGTTTTGA ACAAGCCTTG GAAGCGGAAT GGGATTTATT GGTCGTCGAT GAAGCTCACC ATCTTGAGTG GCACCCAGAA AAGCCAAGCC GCGAATATCA AGTGATTGAA GCTCTGGCCG AACAGACTCC GGGCGTACTG CTATTGACGG CAACGCCTGA GCAGTTAGGT CGTGAAAGCC ACTTTGCTCG TCTGCGCCTG CTGGATGCAG ATCGTTTCTA CGATTACGAA GCCTTTGTTA AAGAAGAAGA GCAATACGCA CCCGTTGCCG ATGCCGTTAC TGCGCTGTTC AGCGGTGAGA AGCTGAGTGA TGAAGCGAAA AACAAAATCA CTGAGCTATT GTCTGAGCAA GATGTGGAGC CGCTGTTTAA AGCGTTGGAA AGCCACGCCA GCGAGGACGA AATTGCTTTG GCGCGCCAAG AGCTGATCGA CAATCTGATG GATCGCCACG GTACTGGGCG TGTGTTGTTC CGTAACACGC GTGCGGCAAT CAAAGGCTTC CCAGTGCGTA ATGTGCATTT GCTGCCATTG GAGATCCCTT CTCAATACAC CACGTCAATG CGTGTTGCGG GCATGCTCGG CGGTAAACTG ACGCCAGAAG CGCGCGCGAT GAAAATGCTC TACCCGGAAG AGATTTTCCA AGAGTTTGAA GGTGACGAAT CAAGCTGGTG GCAATTTGAC TCACGCGTTA ACTGGTTGCT CGAAAAAGTC AAAGCCAAAC GCAGCGAGAA GATCCTCGTC ATCGCTTCAC GTGCCAGCAC CGCGTTGCAG CTAGAGCAGG CCTTGCGTGA ACGTGAAGGC ATTCGCGCAA CGGTATTTCA TGAAGGCATG TCGATTATTG AGCGTGACAA AGCCGCTGCT TACTTTGCCC AAGAAGAGGG CGGTGCGCAG GTGTTGATCT GTAGTGAAAT CGGCTCCGAA GGTCGTAACT TCCAGTTTGC GAACCAATTG GTGATGTTTG ATCTGCCGTT CAACCCAGAC TTGCTGGAGC AGCGTATCGG GCGTTTGGAC CGTATCGGCC AAAAGCGTGA TATCGATGTG TACGTGCCGT ATTTGACCGA AACGTCACAA GCGATTTTGG CGCGTTGGTT TCAAGAAGGT TTGAATGCCT TTGCGGAAAC CTGTCCAACG GGTCGTGCGG TGTACGATGC CTTCGCTGAG CGTTTAATCC CAATTCTGGC CGCTGGTGGC GGTGAAGAGC TGGAAGTGAT CATCGAAGAG TCAGCCAAGC TCAACAAAAC ACTGAAATCG CAGCTGGAAG TCGGGCGTGA TCGCTTGTTG GAAATGCATT CTAACGGTGG CGAAAAAGCG CAGCAGATTG CCGAGCAGAT CGCGAAAACC GATGGCGATA CCAATCTGGT GACGTTTGCC TTGAGCCTGT TTGATGCGAT TGGTCTGCAT CAGGAAGATC GTGGCGAGAA TGCTCTAGTG GTCACTCCTG CCGAACACAT GATGGTGCCA AGCTATCCGG GCCTCCCTTA TGAAGGCGCC ACCATTACCT TTGATCGTGA CACCGCACTG TCGCGTGAAG ATATGCACTT CATCAGTTGG GAACACCCCA TGGTGCAAGG TGGCATTGAT TTACTGATGA GTGAAGGGGT GGGCACTTGC GCGGTGTCGC TGTTGAAAAA CAAAGCACTG CCAGTGGGTA CTATCTTGCT GGAGCTGGTG TATGTGGTGG ATGCCCAAGC GCCGAAACGC AGTGGCATCA GCCGCTTCTT GCCTGTCTCG CCAATCCGAA TTCTGATGGA TGCGCGCGGT AATGATCTCT CTTCGCAAGT GGAGTTTGAA AGCTTTAACC GTCAGCTCAG CCCAGTGAAT CGTCATTTGG CCAGCAAGCT GGTGAGTTCA GTACAGCATG ACGTGCATCG TTTGATTACG GCAAGTGAAG CGGCGGTTGA ACCGCGAGTG AGCGCGATTC GTGAACAAGC GCAGCGTGAT ATGCAGCAGA GCCTTAACAG CGAGTTGGAG CGTTTGCTGG CACTCAAAGC GGTTAACCCG AACATTCGTG ATGAAGAGAT CGAAGTGCTG GAGCAGCAAA TCAAAGAGTT GACTGGCTAT ATTGCGCAGG CGCAGTATCA GCTGGATTCA CTGCGTTTGA TTGTGGTGGC ACACAACTGA
|
Protein sequence | MSFALGQRWI SDTESDLGLG TVVALDARTV TLMFAASEEN RVYARSDAPV TRVIFNVGDV VDSQQGWSLQ VEQVVEDQGV YTYLGTRVDT EESGVALREI FLSNQIRFNK PQDKLFAGQI DRMDNFVLRY RALANQYQQH KSPMRGLCGM RAGLIPHQLY IAHEVGRRHA PRVLLADEVG LGKTIEAGMI IHQQVLTGRA ERILIVVPET LQHQWLVEMM RRFNLHFSIF DEERCVEAFS EADNPFETQQ YVLCSLDFLR KSRQRFEQAL EAEWDLLVVD EAHHLEWHPE KPSREYQVIE ALAEQTPGVL LLTATPEQLG RESHFARLRL LDADRFYDYE AFVKEEEQYA PVADAVTALF SGEKLSDEAK NKITELLSEQ DVEPLFKALE SHASEDEIAL ARQELIDNLM DRHGTGRVLF RNTRAAIKGF PVRNVHLLPL EIPSQYTTSM RVAGMLGGKL TPEARAMKML YPEEIFQEFE GDESSWWQFD SRVNWLLEKV KAKRSEKILV IASRASTALQ LEQALREREG IRATVFHEGM SIIERDKAAA YFAQEEGGAQ VLICSEIGSE GRNFQFANQL VMFDLPFNPD LLEQRIGRLD RIGQKRDIDV YVPYLTETSQ AILARWFQEG LNAFAETCPT GRAVYDAFAE RLIPILAAGG GEELEVIIEE SAKLNKTLKS QLEVGRDRLL EMHSNGGEKA QQIAEQIAKT DGDTNLVTFA LSLFDAIGLH QEDRGENALV VTPAEHMMVP SYPGLPYEGA TITFDRDTAL SREDMHFISW EHPMVQGGID LLMSEGVGTC AVSLLKNKAL PVGTILLELV YVVDAQAPKR SGISRFLPVS PIRILMDARG NDLSSQVEFE SFNRQLSPVN RHLASKLVSS VQHDVHRLIT ASEAAVEPRV SAIREQAQRD MQQSLNSELE RLLALKAVNP NIRDEEIEVL EQQIKELTGY IAQAQYQLDS LRLIVVAHN
|
| |