Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU0051 |
Symbol | |
ID | 2688418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 65570 |
End bp | 68335 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637124716 |
Product | CRISPR-associated HD domain-containing protein |
Protein accession | NP_951113 |
Protein GI | 39995162 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR02621] CRISPR-associated helicase Cas3, Anaes-subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGAAC GGGGGATAAC TATGTCGACA TCTGAAGATT TTGAATCAAA CTACAAACTC CTAACAGGTA ATAGTCCTTT CCCATGGCAG CGGAAGCTAT TCACCATGTT TGTAGACAAA CGGTTTCCAG AAACTTGCCC TGTCCCGACT GGCCTCGGGA AAACTTCAAT CATCGCAATC TGGCTTCTGG CAGTGGCCCA CCATACCCGT AACGGCACAG TTACTGAATT TCCGCGGCGC CTCGTCTATG TGGTGAACCG GAGAACCGTG GTGGACCAGG CTACGGATGA AGCCAGAAAA ATGCTTGAAG CACTCATTAC CAAGCCAGAA CTCTGTGCTG TTGCTGATGT GCTGAGTTCG CTCGCGACCC GGGCTGATGC AGCACCTTTG GCGATCAGCA CGCTACGAGG ACAGTTCGCT GACAACGCGG AGTGGCGGGA CGACCCGGCC CGGTCTGCCG TGATCGTCGG CACGGTGGAT ATGATTGGCA GTCGATTGCT CTTTTCCGGT TACGGGTGTG GGTTCAAGTC CCGTCCGTTG CATGCTGGAT TTCTCGGACA GGACACCTTA CTGGTTCATG ACGAGGCCCA CCTGGAGCCC GCATTTCAGG AGCTGGTATC GGCCATTGCA TCGGAACAGC AGCGCCGCCG CGACTTCTGG CAGTTACGAG TCATGGAACT AACGGCAACT TCTCGTTCTG ACATTACTGA CGCGAATACT CTCTTTACAG GGGAAGACCG GAAACATGAT GTGGCCCGGA AACGACTCGA AGCGAAAAAA GGGATTTCTT TTCATCCGGT TGACGATGAA AAGAGGGTTA CTGAGAAGGT TGAACAACTG ACCAAAGGGT ACAAAGACAG TGGGCAGGCA ATTCTCGTCT TTCTTCGCGG GGTCAAAGAG GTGATGAAGG TAGCGGCATT TCTGCGCAAG GTGGTGCCGG ATGGTGTAGA GACCCTGACC GGCACCCTGC GCGGCTTCGA GCGCGACACG TTGGCCAGGG AAAATGCCGT CTTTGCCCGC TTCATGCCTC ACTCCGAAGC GATACCACAG CAGGGCACCG TCTATCTCGT GTGTACCTCC GCCGGAGAAG TCGGGGTCAA CATATCAGCG GATCATCTTG TCTGCGACCT GACCCCCTTC GAAAGCATGA TGCAACGCTT TGGACGGGTG AATCGATTTG GTGATGGCAA TGCACATATC GACGTCGTTC ATCGCCCATT CGGAAAGGGC TCTGGTTTCG ACGAATCTGC GCCCGCCGCA TTGGACGATG ATCCGTCAGC GGGGGCCGCC AGCCAGGACG ATCTAACTGC CGACGCGGCG CCCAAGAAGA AAGAGAAACC GTTGTCACCT TTTGATCGGG CTTGCGCACA AACGCTGTCC GTGTTGAAGC AACTGCCCTT GCGTGAAGAC CGGCGTCGCG ATGCGAGCCC TGCGGCACTA AGCGAAATAC CCCTTGCCGA CCGGCAAGCT GCTTTCACGC CACCTCCCGT CATTCTTCCG ATAAGTGACA TTCTTTTCGA TGTCTGGGCG CTGACCTCGG TTCGCCAGAA ACTGCCCGGC CGACCGCCAG TTGCCGATTG GCTGCATGGC GTTACCGAAT GGGAGCCGCC TGCAACCCAT GTTGCATGGC GGGATGAGGT CTATCTGGTC AGCGGCATGT TGGAGATCTG TCCGCCCGAA GACCTGCTTG AAGACTATCC CCTCAAGCCT CACGAATTGT TGCGCGACCA GACCACACGG GTGTTTTCCG AGCTTGAGAA AATCGCCAGC CGATGTCCGG CAAAGCCTGT ATGGCTTCTC AACGCTGACG GCAAGGTTGA CGTGCTTCCC TTGGAGAAAT TAGTCGATAG GGACAAGCTG AGACTGGCCG ACTGTACTGT GTTGCTGCCG CCAGCGGTTG GTGGCCTCGA CGGAGGCATG CTCAACGGTG ATGCCGCTTA TGACCCCAAG GTGGTTTATG ACATAGCAGA TCAATGGTTG GGCGAGAACG ACAGGCCGCG TCGTTGCCGC CTTTGGGACA ACGAGGGCCC TTTATCAGGA ATGCGGCTTG TGCGAACTAT CGACGTGCGC CCCGATGCCG AAGAACAGGA GGAAGAAAGC GCAGAATCAA CTATACGGCG GTGCTGGCAT TGGTATGTAC GGCCCCGCTC CGCTGACGAT GATGGCTCAA GGACGGCACG AGTAAATCAG GAACTGACAT CCCACCTTCG GTCTGCCGAA GACTTCGCCA AGAAGCTCGT CGACAGGCTC GGCTTGGGCG CGCTGGAAGC AAAGGCGGTA GTCTTGGCAG CGCGCTGGCA TGACCTCGGC AAAGATCGCC TGATCTGGCA GCGCTCCATC GGTAATCGTG ACTACCCTGG GCTTGTCCTG GCGAAGTCCG GGTCCGGAAT GCGGCCTATC GATCTCAGCG ACTATCGGCA TGAGCTCGGA TCACTTATCG ACATATCCAA CTACCCGGAG TTTCTTGAAT TGTCTGAGGA GTTACAAGAT CTCGTTCTGC ATCTGGTCGC AGCCCATCAT GGCCGGGCTC GGCCCCACTT CCCGGAAAAG GAAACGTTTG ATTATGACCG GTCAGAAGAG GCCGTAGCGG CTATCGTCAG TGAGGTCCCC CGCCGTTATG GCCGGTTGCA GCGGAAATAC GGCCGGTGGG GGCTTGCCTA TCTTGAATCC TTGGTACGTG CTGCCGATGC CATGGCGTCA CAATATGTCG CTGGTGAAGA CCCCGCTTAT GGCAATGCTG CGTTGTCCCT GGGAGGTGCC CAATGA
|
Protein sequence | MVERGITMST SEDFESNYKL LTGNSPFPWQ RKLFTMFVDK RFPETCPVPT GLGKTSIIAI WLLAVAHHTR NGTVTEFPRR LVYVVNRRTV VDQATDEARK MLEALITKPE LCAVADVLSS LATRADAAPL AISTLRGQFA DNAEWRDDPA RSAVIVGTVD MIGSRLLFSG YGCGFKSRPL HAGFLGQDTL LVHDEAHLEP AFQELVSAIA SEQQRRRDFW QLRVMELTAT SRSDITDANT LFTGEDRKHD VARKRLEAKK GISFHPVDDE KRVTEKVEQL TKGYKDSGQA ILVFLRGVKE VMKVAAFLRK VVPDGVETLT GTLRGFERDT LARENAVFAR FMPHSEAIPQ QGTVYLVCTS AGEVGVNISA DHLVCDLTPF ESMMQRFGRV NRFGDGNAHI DVVHRPFGKG SGFDESAPAA LDDDPSAGAA SQDDLTADAA PKKKEKPLSP FDRACAQTLS VLKQLPLRED RRRDASPAAL SEIPLADRQA AFTPPPVILP ISDILFDVWA LTSVRQKLPG RPPVADWLHG VTEWEPPATH VAWRDEVYLV SGMLEICPPE DLLEDYPLKP HELLRDQTTR VFSELEKIAS RCPAKPVWLL NADGKVDVLP LEKLVDRDKL RLADCTVLLP PAVGGLDGGM LNGDAAYDPK VVYDIADQWL GENDRPRRCR LWDNEGPLSG MRLVRTIDVR PDAEEQEEES AESTIRRCWH WYVRPRSADD DGSRTARVNQ ELTSHLRSAE DFAKKLVDRL GLGALEAKAV VLAARWHDLG KDRLIWQRSI GNRDYPGLVL AKSGSGMRPI DLSDYRHELG SLIDISNYPE FLELSEELQD LVLHLVAAHH GRARPHFPEK ETFDYDRSEE AVAAIVSEVP RRYGRLQRKY GRWGLAYLES LVRAADAMAS QYVAGEDPAY GNAALSLGGA Q
|
| |