Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0427 |
Symbol | |
ID | 6973821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 466452 |
End bp | 468092 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643389959 |
Product | CRISPR-associated protein, Cse1 family |
Protein accession | YP_002274838 |
Protein GI | 209542609 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00538764 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.144998 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTTC TGACCGCTTC ATGGCTGCCC ATTCGCCGGA AATCCGGCGC GGCGGAGACC ATTCGTCCGG CGCAGATCGT GGACCGCGTG GCCGACGATC CGATCATGGC GCTCGATTGG CCACGGGCGG ATTTCCGCAT CGCGTCCCTT GAATTCCTGA TCGGATTGCT GGCAACGGCG TTTCCGCCCA AAAACGAGGA TATCTGGTGC GAAACCTGGG AAGACCCACC TTCCGTGGAG GCGCTGGACG AAGCCTTCGC CCCTGTGGCC GAAGCCTTCT GGCTCGATGG GCCGGGGCCA CGCTTCCTGC AGGACCTGGA AAATCTCCAG AGCGGCCAGG AGCCTGTGGA ACGTCTGCTG ATCGATGCGC CTGGCGATAG CACCGTAAAG AAGAACACGG ATCTGTTCGT GCATCGGCAG CGGATCATGG CGCTGGGGCG TCCGGCCGCC TGCATGGCCC TGTTTACCCT CCAGTCCTGG GCGCCGAGCG GCGGTGCGGG AAACATGACC GGCCTGCGGG GCGGCGGCCC CCTGGTCACG CTGGTCCTGC CCCGGGAGGG AGCGAGCCTG TGGGAGATGG TATGGGCGAA TACGCCCTTC GGGGTGCCTC CGTCCGAAGC AGACCTGCCC CGTGTGTTCC CGTGGCTCGC GCCGACAATC GGGTCAGGCA AGGATGGAAC CAGTGTACGC CCAGGCCATA ACGCCCATCC GCTGCAATGC TGGTGGGGAA TGCCGCGGCG TATTCGCCTC GATTTCGAAG CGGCGGAGGA CGGCATATGC GACCTGACGG GGCAGCCGGA TGCCGTTCTG GTACCGGGCT GGCGACAGCG TCCTTATGGC GCAAGCTACG CCGACTGGAC AGGGATGCCC TATGGCGCGG GCGCATCCAT CCACCCGTTG ACGCCCCGTT ATCGCCAGAA GAAAGACGCC GAATGGCTGA GCGTCCACCC GCAGCCGGGT GGAATCGGAT ACCGCCATTG GGCCGGGATT GTCGTGAACA GCAGCGATAC GCACCGGCTT CCGGCAAGCA CGGTGCTGTC ATGGCGTAAC GACCGGGCCC GGAACGTTGC CGCCTCGCTC ACGCCTCGGC TTCTGGCGGC CGGTTATGAC ATGGACAATA TGAAGGCCCG CAGCTTCGTC GAGAGCGAGA TGCCGCTGCC CGGCGTGGTC GATCCGGTGC GTCAGGAGGC GCTTGACGCA CTGGCCCGAG CGTATGTCGA GGCCGCCGAC CAAGTGGCGG GTATCCTCCG ACAGTGTGTC CGGGAGGCCC TGTTTGGCAA GGGGACCATC TCGCCAGATG CAACGCTGTT CTCCGGCCTG CGCGAACGGT TCTGGGCGCA GACGGAGGGA ACGTTCTTCG ATCTTCTCCA TCAGGCGGTG CTGCTGGGTG ACGGAGATGA CATCGATCTG CGGCGCATCT GGCTGCGCGC CCTTCGGCGG GTGGCCCTGG ACCTGTTCGA CTCCGCCGTC ATGCTGACTC CGGACACGGG AACGACCGAG GCGCAACGAT CCGCGCTGGC GCGCCGGCGG CTCGGGGCGG CTGTCGCGGG CGGTGGAAAG GAAGGGCAGG CGCTCATGAT GGTTCTGAAA ATTCCCGTGC CCGTTACACG GAAAGCGACG TCCAGGATGA AGCAGCCATG A
|
Protein sequence | MNLLTASWLP IRRKSGAAET IRPAQIVDRV ADDPIMALDW PRADFRIASL EFLIGLLATA FPPKNEDIWC ETWEDPPSVE ALDEAFAPVA EAFWLDGPGP RFLQDLENLQ SGQEPVERLL IDAPGDSTVK KNTDLFVHRQ RIMALGRPAA CMALFTLQSW APSGGAGNMT GLRGGGPLVT LVLPREGASL WEMVWANTPF GVPPSEADLP RVFPWLAPTI GSGKDGTSVR PGHNAHPLQC WWGMPRRIRL DFEAAEDGIC DLTGQPDAVL VPGWRQRPYG ASYADWTGMP YGAGASIHPL TPRYRQKKDA EWLSVHPQPG GIGYRHWAGI VVNSSDTHRL PASTVLSWRN DRARNVAASL TPRLLAAGYD MDNMKARSFV ESEMPLPGVV DPVRQEALDA LARAYVEAAD QVAGILRQCV REALFGKGTI SPDATLFSGL RERFWAQTEG TFFDLLHQAV LLGDGDDIDL RRIWLRALRR VALDLFDSAV MLTPDTGTTE AQRSALARRR LGAAVAGGGK EGQALMMVLK IPVPVTRKAT SRMKQP
|
| |