Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0759 |
Symbol | cas3 |
ID | 8709870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 860091 |
End bp | 862937 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 646482861 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_003373983 |
Protein GI | 283783229 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0254151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTGTA ATCACGTTGT TAATTCTGCA TTATGGGGTA AAAAGAGAGA AGCTAATGGA GTAATGCAAT GGCTTCCTCT TGCTCAACAT CTTGAGGATA CTAGAAATGT TATTGGTCAG CTGTGGGAGC ATTGGCTTAG TGGTGGTCAG AGAAGATTGA TTGAATCCTC ATTAAGCAAA CGAGTTGATG CAAAAAAATT GTCTCAATTT TTAGGCTGTG TTCATGATAT TGGTAAAGCT ACTCCTGTTT TTCAGTTTCG TAAATCGCCT TCAAATTCAA AAGATTTAGA TATTGCGTTA AAAAATAAGT TAGCTACTGT TGGTTTTACA AATATTGATT ATTTTATTGA TACGACAGAA GGTAGTCATA ATAGTCATCA CACTATAACT GGTCAATTTA TTCTTTCTAA TGCTGGAGTT CCTGAAGGGA TTTGTGCAAT TGTTGGAGCT CATCATGGTA AGCCTTTAAA TAATGATTCT GTTTGTAGAA GTAATAAGTC TAAATATCCT GACCATTACT ATCAAAGTGA AACTGAGAGT GAAAATAGTA AATTATGGAA AAAGTTACAG AATGATATTT TGGATTGGGC TTTAGAAAGA AATGATTTTT CTAATGTAAA TGATCTTCCT GAAATAAGTG AACCTGCGCA AGTTTTGTTG TGCGGATTAG TAATTATGGC TGATTGGATT GCTAGTAATG AACACTATTT TCCGCTTATT TCAATAGAGC AAGATTTGAT TGAAAATCAG GAAGAAAGAT ATAGAAAAGG TTGGGAAAAC TGGTTACAGC ATGGTTCTAA AGACGTATGG GAATCGTTAA ATTGTTGTAG CAATGTTTCG CAAACATACA AATATCGTTT TGGTTTCTTT CCAAATAATA TTCAGATAGC GTTGCATGAT GTAATTTCTC AGTCTAAAGA GCCTGGAATT TTTATTCTTG AATCTGCAAT GGGTTCAGGT AAAACAGAAG CTTCTTTAAT TGCTGCTGAG CAATTGGCAA ATCTAACTGG AAGAAGCGGA GTGTTCTTCG GTCTTCCTAC TCAAGCCACT TCTAACGGAA TGTTTAGACG AGTTGAGGAT TGGCTTAAAA ATGTGAATAG TGATTTCCAG GGCGAAATTG GTTTACGTTT AGTTCATGGT AAAGCTGAAC TGAATGCAGA TTATGCGCAT TTGCAGCATG GAATGCAAAA TATGAATGAT GGTTGTGAAA GTACTTCTAA TAGTAATGAT GTAAATAATA ATGGAGTAAT ACTTAATGAT TGGTTTACGG GTCGAAAAAC CGCAATGTTG GATGACTTTG TGGTTGGTAC AGTAGATCAA TTTTTGTTAG CTTCTCTAAA GCAAAAACAT CTTATGCTCC GTCATTTGGG TTTGAGTAAA AAAGTTGTAA TTATTGATGA AGTGCATGCT TATGACGCTT ATATGAATAA GTATCTTGAA GAATCTCTTA TTTGGATGGC GGCATACGGT GTTCCTGTAG TGTTACTTTC TGCTACTTTA CCTGCTAAAC GTAGAAAAGA ACTTATAAAA GCTTATATGT GTGGATTGTT TGGGTTTAAT TGGAGAGAAT GTGATAAATC TAATGTTGAT TTTGAAACAA ATAATTATCC TTTAATTACA TACAGTGATA AAAACTGTGT AAAACAAAAA TTTATTGAAA ATGATGCTAG TGACAATAAA TCTGTTTCTG TAAGGAAAAT AACAGACGAT AACTTGCATG AATCACTTGT TGGTGAGCTT AAATCTTTGC TGAATAATGG TGGTATTGCT GGAATTATAG TGAATACCGT TAAAAGAGCT CAGGAAATTT ATAATGCGTG TGTTGATGAG TTTAGTGATG ACGAAGTTAT AGTTATTCAT TCGCAATTTA TCGCTACAGA TAGAGTTAGA AAAGAACAGC AAATTTGCAA TATGATTGGC AAGAATGCTC ATAGACCTGC TCGTGCAATA ATTATTGGTA CGCAAGTATT AGAGCAGTCG CTTGATATTG ATTTTGATGT TTTGTTTACT GATCTTGCTC CTATAGATTT ATTACTTCAA CGTGCTGGCA GATTACACAG GCATACGATA GAACGCTCAG AAACTTTCGC AGAGCCAATT TTATATGTGT TGGGAACCAG TGATCGTTAT GAATTTGATA AAGGTTCAGA GTCTATTTAC AGCAAATATT TACTTATGAG AACTCAATAC TATTTGCCAA ATGTTATAAA CATGTCTCAT GATATTTCTC GATTAGTGCA AATAGTTTAT GGCGATAATC CTTTAGAGTT ACAAGAAGAT TTGAAAGATG TGTATGCTGT TGCAAAAAGA GAACATGATT CAGTAAGAAA TAGTAATGAA AGTGCTGCAA AAACATATAG AATTGAAAAT CCAGAATCAG AAATTGGTGA AAAATCTATT GTTGGCTTGT TGACAAATTC AATTACAAAT GAATCTGATG AATTTGCATG TGCTCAAGTT CGTAATAGTG GTGAATCTAT AGAAGTTATT GCTGTTAAAA GAGTTGGATC CGGCTATGGA ACTTTACATG ATTGTAAAGA TATTTCACAA AATATTGATG ATGTGGAAGT TGCAATGAAA CTTGCTCAAG AGACAGTAGG ATTGCCGTGG ATGTTTACAT TAAACAGCGA TCGTGTTGAT GAAACAATTG CAGAATTGGA AAGAATACGA AAACAAAATC AATTCAAAAA TTGGGATAAT CAGCCTTGGT TGCGAGGTTC TTTGGTGCTT CTATTCGATG AAAATAATAT TTGCGAATTG TCAAAATACA GAGTTGTTTA TTCTGAAAAA AGTGGTATTG TATGTATTAA GAGTTCGGAA GAAGACAGAA AGGAATTAAG AAGGTGA
|
Protein sequence | MSCNHVVNSA LWGKKREANG VMQWLPLAQH LEDTRNVIGQ LWEHWLSGGQ RRLIESSLSK RVDAKKLSQF LGCVHDIGKA TPVFQFRKSP SNSKDLDIAL KNKLATVGFT NIDYFIDTTE GSHNSHHTIT GQFILSNAGV PEGICAIVGA HHGKPLNNDS VCRSNKSKYP DHYYQSETES ENSKLWKKLQ NDILDWALER NDFSNVNDLP EISEPAQVLL CGLVIMADWI ASNEHYFPLI SIEQDLIENQ EERYRKGWEN WLQHGSKDVW ESLNCCSNVS QTYKYRFGFF PNNIQIALHD VISQSKEPGI FILESAMGSG KTEASLIAAE QLANLTGRSG VFFGLPTQAT SNGMFRRVED WLKNVNSDFQ GEIGLRLVHG KAELNADYAH LQHGMQNMND GCESTSNSND VNNNGVILND WFTGRKTAML DDFVVGTVDQ FLLASLKQKH LMLRHLGLSK KVVIIDEVHA YDAYMNKYLE ESLIWMAAYG VPVVLLSATL PAKRRKELIK AYMCGLFGFN WRECDKSNVD FETNNYPLIT YSDKNCVKQK FIENDASDNK SVSVRKITDD NLHESLVGEL KSLLNNGGIA GIIVNTVKRA QEIYNACVDE FSDDEVIVIH SQFIATDRVR KEQQICNMIG KNAHRPARAI IIGTQVLEQS LDIDFDVLFT DLAPIDLLLQ RAGRLHRHTI ERSETFAEPI LYVLGTSDRY EFDKGSESIY SKYLLMRTQY YLPNVINMSH DISRLVQIVY GDNPLELQED LKDVYAVAKR EHDSVRNSNE SAAKTYRIEN PESEIGEKSI VGLLTNSITN ESDEFACAQV RNSGESIEVI AVKRVGSGYG TLHDCKDISQ NIDDVEVAMK LAQETVGLPW MFTLNSDRVD ETIAELERIR KQNQFKNWDN QPWLRGSLVL LFDENNICEL SKYRVVYSEK SGIVCIKSSE EDRKELRR
|
| |