Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0342 |
Symbol | |
ID | 6973736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 382737 |
End bp | 385748 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643389874 |
Product | CRISPR-associated protein, Csn1 family |
Protein accession | YP_002274753 |
Protein GI | 209542524 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0739339 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0465454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACG AAAGCCTGAC ATTCGGGATC GACTTGGGCA TCGGTTCATG CGGCTGGGCC GTACTGCGGC GGCCGTCTGC CTTCGGAAGA AAAGGCGTGA TAGAAGGAAT GGGGAGTTGG TGCTTCGATG TTCCCGAAAC CAGCAAGGAA CGGACGCCCA CCAACCAGAT TCGGCGTTCC AACCGACTGC TACGGCGGGT AATCCGCCGC CGCCGCAACC GTATGGCCGC AATCAGGCGA TTACTCCACG CCGCCGGGCT GCTTCCCTCG ACCGACAGCG ATGCGTTGAA ACGCCCTGGC CACGATCCAT GGGAGTTGCG CGCGCGCGGC CTCGACAAAC CGTTGAAACC CGTCGAGTTC GCGGTCGTGC TTGGCCATAT CGCCAAAAGG CGGGGCTTCA AATCCGCCGC CAAGCGCAAG GCGACAAACA TCAGCAGCGA CGACAAGAAG ATGCTGACCG CACTGGAAGC CACTCGCGAG CGGCTGGGGC GCTACCGCAC GGTCGGCGAA ATGTTTGCGC GTGATCCTGA TTTTGCCAGC CGCCGCCGCA ACCGCGAAGG CAAATATGAC CGCACTACTG CCCGTGACGA CCTGGAGCAT GAGGTCCACG CCCTGTTCGC CGCGCAGCGC CGGCTGGGAC AGGGTTTCGC CTCGCCAGAA CTGGAAGAAG CGTTCACCGC CAGTGCCTTC CACCAGCGGC CGATGCAGGA CAGCGAAAGG CTGGTAGGTT TCTGCCCATT CGAAAGGACG GAGAAGCGCG CAGCGAAATT GACTCCGTCC TTTGAGCGAT TCCGCCTGCT GGCCCGGCTC CTCAACCTGC GCATTACGAC GCCCGATGGC GAACGCCCGT TGACCGTCGA TGAAATCGCT CTCGTTACCC GCGATCTCGG CAAGACTGCA AAGCTGTCGA TCAAGCGCGT GCGGACCCTG ATCGGTCTGG AGGACAATCA ACGTTTTACC ACGATCAGGC CGGAGGATGA GGATCGCGAT ATCGTTGCCC GGACCGGTGG GGCGATGACG GGAACCGCGA CCCTCCGCAA GGCTCTCGGC GAGGCGCTTT GGACCGACAT GCAAGAGAGG CCGGAGCAGC TTGACGCGAT TGTCCAGGTA CTCAGTTTCT TCGAGGCGAA CGAAACAATA ACGGAGAAAT TGCGTGAGAT CGGTCTGACG CTTGCGGTCC TCGACGTTTT ACTGACGGCG CTGGATGCTG GCGTATTCGC GAAATTCAAG GGCGCTGCCC ATATTTCGAC GAAAGCGGCG CGCAACCTGC TGCCCCATCT CGAACAAGGC CGACGCTATG ACGAAGCTTG CACGATGGCC GGCTATGACC ATGCTGCCTC CCGCCTTTCC CATCACGGTC AGATCGTCGC AAAGACCCAG TTCAACGCAC TGGTCACGGA AATCGGGGAA AGCATTGCCA ACCCTATCGC CCGCAAAGCC CTGATCGAAG GGCTCAAGCA GATCTGGGCG ATGCGTAATC ACTGGGGCTT GCCCGGTTCG ATCCATGTCG AACTTGCCCG CGATGTCGGC AACAGTATCG AAAAGCGACG GGAAATTGAA AAACATATTG AAAAAAATAC CGCCCTGCGC GCGCGCGAAC GTCGGGAGGT CCATGATCTT CTTGATCTGG AAGATGTGAA CGGCGACACA TTGCTGCGTT ACCGACTATG GAAAGAACAG GGAGGCAAAT GCCTGTATAC CGGCAAGGCT ATCCACATTC GTCAGATAGC AGCCACCGAC AACAGCGTCC AGGTGGATCA TATCCTGCCT TGGTCCCGCT TCGGCGATGA CAGCTTCAAC AACAAGACGC TGTGTCTTGC CAGTGCCAAC CAGCAAAAAA AGCGATCGAC GCCCTATGAA TGGCTCTCCG GACAGACTGG CGATGCGTGG AACGCTTTTG TACAGCGGAT CGAGACCAAC AAGGAACTGC GCGGCTTCAA GAAGCGCAAT TATCTGCTGA AAAACGCCAA AGAGGCTGAA GAGAAATTCC GCAGCCGCAA TCTCAATGAC ACGCGCTATG CCGCACGCCT GTTCGCGGAA GCAGTGAAAC TGCTTTATGC TTTTGGCGAG CGGCAGGAAA AGGGCGGTAA TCGTCGCGTC TTCACCCGGC CCGGCGCTCT TACGGCGGCT TTGCGTCAGG CATGGGGGGT GGAATCACTC AAGAAACAGG ATGGCAAGCG CATCAATGAT GACCGCCATC ACGCGCTGGA TGCGCTGACA GTGGCGGCAG TTGACGAAGC CGAGATCCAG CGGCTGACCA AATCTTTTCA CGAATGGGAA CAGCAGGGAC TGGGCCGACC ACTGCGACGT GTCGAACCGC CGTGGGAAAG CTTTCGTGCG GATGTTGAGG CAACCTATCC GGAAGTATTC GTTGCCCGCC CCGAACGCCG CCGCGCGCGT GGCGAGGGCC ATGCCGCGAC AATCCGGCAG GTGAAAGAGC GCGAATGTAC GCCCATCGTC TTTGAACGGA AGGCCGTTTC CAGCCTCAAG GAAGCCGACC TTGAACGGAT TAAGGATGGC GAACGCAATG AGGCCATCGT GGAGGCTATA CGCTCCTGGA TCGCAACCGG CCGGCCCGCC GACGCCCCAC CCCGTTCGCC ACGCGGCGAT ATTATCACCA AGATCCGTCT GGCGACCACG ATCAAGGCCG CCGTTCCCGT CCGCGGGGGC ACTGCCGGTC GAGGAGAAAT GGTACGGGCG GACGTGTTTA GCAAGCCCAA CCGCAGGGGC AAGGACGAAT GGTATCTGGT GCCCGTTTAT CCACATCAGA TCATGAATCG GAAGGCTTGG CCAAAGCCAC CGATGCGGTC GATAGTTGCC AATAAGGATG AGGATGAATG GACCGAAGTC GGTCCCGAGC ATCAATTTCG CTTTAGTCTT TATCCTCGCT CCAATATAGA GATCATAAGA CCGAGTGGAG AAGTGATCGA AGGATATTTC GTGGGCCTTC ATCGGAACAC GGGGGCACTA ATACCAACTC CGGTCGGCCC TGACTCATAT GTGATTGCCT GA
|
Protein sequence | MIDESLTFGI DLGIGSCGWA VLRRPSAFGR KGVIEGMGSW CFDVPETSKE RTPTNQIRRS NRLLRRVIRR RRNRMAAIRR LLHAAGLLPS TDSDALKRPG HDPWELRARG LDKPLKPVEF AVVLGHIAKR RGFKSAAKRK ATNISSDDKK MLTALEATRE RLGRYRTVGE MFARDPDFAS RRRNREGKYD RTTARDDLEH EVHALFAAQR RLGQGFASPE LEEAFTASAF HQRPMQDSER LVGFCPFERT EKRAAKLTPS FERFRLLARL LNLRITTPDG ERPLTVDEIA LVTRDLGKTA KLSIKRVRTL IGLEDNQRFT TIRPEDEDRD IVARTGGAMT GTATLRKALG EALWTDMQER PEQLDAIVQV LSFFEANETI TEKLREIGLT LAVLDVLLTA LDAGVFAKFK GAAHISTKAA RNLLPHLEQG RRYDEACTMA GYDHAASRLS HHGQIVAKTQ FNALVTEIGE SIANPIARKA LIEGLKQIWA MRNHWGLPGS IHVELARDVG NSIEKRREIE KHIEKNTALR ARERREVHDL LDLEDVNGDT LLRYRLWKEQ GGKCLYTGKA IHIRQIAATD NSVQVDHILP WSRFGDDSFN NKTLCLASAN QQKKRSTPYE WLSGQTGDAW NAFVQRIETN KELRGFKKRN YLLKNAKEAE EKFRSRNLND TRYAARLFAE AVKLLYAFGE RQEKGGNRRV FTRPGALTAA LRQAWGVESL KKQDGKRIND DRHHALDALT VAAVDEAEIQ RLTKSFHEWE QQGLGRPLRR VEPPWESFRA DVEATYPEVF VARPERRRAR GEGHAATIRQ VKERECTPIV FERKAVSSLK EADLERIKDG ERNEAIVEAI RSWIATGRPA DAPPRSPRGD IITKIRLATT IKAAVPVRGG TAGRGEMVRA DVFSKPNRRG KDEWYLVPVY PHQIMNRKAW PKPPMRSIVA NKDEDEWTEV GPEHQFRFSL YPRSNIEIIR PSGEVIEGYF VGLHRNTGAL IPTPVGPDSY VIA
|
| |