Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0422 |
Symbol | |
ID | 6973816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 462365 |
End bp | 463324 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643389954 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002274833 |
Protein GI | 209542604 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0368078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0376622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCGTC CCATGCCGGG CCTGCCGCCA CCGCGACCGA TTCCGCTGAA GGAACGTTCT TCTCTCCTCT TTGTCGAAAA AGGGCAGATC GACGTTCTCG ACGGTGCCTT CGTGCTGGTC GGCCATAACG GCATTCGCAC GCATATCCCG ATCGGCGGCC TCGCCTGCCT TATGCTGGAA CCGGGTACCC GCGTCAGCCA TGCCGCGGCG GCGCTCGCGG CCCGTGCCGG CACGCTGCTG ATCTGGGTGG GGGAAGCGGG GGTCAGGCTT TACGCCTCGG GCCAGCCAGG GGGGGCGCGG GCGGACAGGC TTTTGCTTCA GGCCCGACTG GCCCTGGATG ATGCCGCCCG CCTGAAGGTC GTTCGCAAGA TGTACGCGCT TCGTTTCGGC GAGGATGCGC CGGAACGGCG TTCGGTGGAC CAGTTGCGGG GTATCGAGGG ATCGCGGGTG CGCGAGACCT ATCGCCTGCT GGCCCGCACG CACGGGGCGG AGTGGGACGG GCGGCGATAC GATCCGCACG AATGGGATAC CGCCAATCTG GTCAATCGCT GTCTATCAGC AGCCACAGCC GCGTTGTACG GGATTACCGA AGCGGCGATT CTGGCTGCGG GTTACGCGCC GGCCATAGGG TTCCTGCATA CGGGAAAGCC GCAGAGCTTT GTGTACGACA TCGCCGATAT CGTGAAGTTC GAGACTGTCG TGCCCGAGGC TTTCCGGGTG GCAGCGGCCG TCCAGCATGA AAGACCTTTG GATGGACAGC GGATTTCGGA CCCTGTGGCA GCCGTTCGCC ACCGATGCCG CGACCAGTTC CGACGTACGA ACATTCTGGG ACGGCTTATC CCGCTGATCG AAGAGATCCT TTCGGCAGGC GGTGCCGAGA TGCCTCCGGC GCAGGATGAG GCCATGCCCG TGGCGTTCAA GGACGCGAAA GGGCTGGGCG ATGCTGGTCA TCGTGGTTGA
|
Protein sequence | MSRPMPGLPP PRPIPLKERS SLLFVEKGQI DVLDGAFVLV GHNGIRTHIP IGGLACLMLE PGTRVSHAAA ALAARAGTLL IWVGEAGVRL YASGQPGGAR ADRLLLQARL ALDDAARLKV VRKMYALRFG EDAPERRSVD QLRGIEGSRV RETYRLLART HGAEWDGRRY DPHEWDTANL VNRCLSAATA ALYGITEAAI LAAGYAPAIG FLHTGKPQSF VYDIADIVKF ETVVPEAFRV AAAVQHERPL DGQRISDPVA AVRHRCRDQF RRTNILGRLI PLIEEILSAG GAEMPPAQDE AMPVAFKDAK GLGDAGHRG
|
| |