Gene Gdia_0342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0342 
Symbol 
ID6973736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp382737 
End bp385748 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content59% 
IMG OID643389874 
ProductCRISPR-associated protein, Csn1 family 
Protein accessionYP_002274753 
Protein GI209542524 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01865] CRISPR-associated protein, Csn1 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0739339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0465454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG AAAGCCTGAC ATTCGGGATC GACTTGGGCA TCGGTTCATG CGGCTGGGCC 
GTACTGCGGC GGCCGTCTGC CTTCGGAAGA AAAGGCGTGA TAGAAGGAAT GGGGAGTTGG
TGCTTCGATG TTCCCGAAAC CAGCAAGGAA CGGACGCCCA CCAACCAGAT TCGGCGTTCC
AACCGACTGC TACGGCGGGT AATCCGCCGC CGCCGCAACC GTATGGCCGC AATCAGGCGA
TTACTCCACG CCGCCGGGCT GCTTCCCTCG ACCGACAGCG ATGCGTTGAA ACGCCCTGGC
CACGATCCAT GGGAGTTGCG CGCGCGCGGC CTCGACAAAC CGTTGAAACC CGTCGAGTTC
GCGGTCGTGC TTGGCCATAT CGCCAAAAGG CGGGGCTTCA AATCCGCCGC CAAGCGCAAG
GCGACAAACA TCAGCAGCGA CGACAAGAAG ATGCTGACCG CACTGGAAGC CACTCGCGAG
CGGCTGGGGC GCTACCGCAC GGTCGGCGAA ATGTTTGCGC GTGATCCTGA TTTTGCCAGC
CGCCGCCGCA ACCGCGAAGG CAAATATGAC CGCACTACTG CCCGTGACGA CCTGGAGCAT
GAGGTCCACG CCCTGTTCGC CGCGCAGCGC CGGCTGGGAC AGGGTTTCGC CTCGCCAGAA
CTGGAAGAAG CGTTCACCGC CAGTGCCTTC CACCAGCGGC CGATGCAGGA CAGCGAAAGG
CTGGTAGGTT TCTGCCCATT CGAAAGGACG GAGAAGCGCG CAGCGAAATT GACTCCGTCC
TTTGAGCGAT TCCGCCTGCT GGCCCGGCTC CTCAACCTGC GCATTACGAC GCCCGATGGC
GAACGCCCGT TGACCGTCGA TGAAATCGCT CTCGTTACCC GCGATCTCGG CAAGACTGCA
AAGCTGTCGA TCAAGCGCGT GCGGACCCTG ATCGGTCTGG AGGACAATCA ACGTTTTACC
ACGATCAGGC CGGAGGATGA GGATCGCGAT ATCGTTGCCC GGACCGGTGG GGCGATGACG
GGAACCGCGA CCCTCCGCAA GGCTCTCGGC GAGGCGCTTT GGACCGACAT GCAAGAGAGG
CCGGAGCAGC TTGACGCGAT TGTCCAGGTA CTCAGTTTCT TCGAGGCGAA CGAAACAATA
ACGGAGAAAT TGCGTGAGAT CGGTCTGACG CTTGCGGTCC TCGACGTTTT ACTGACGGCG
CTGGATGCTG GCGTATTCGC GAAATTCAAG GGCGCTGCCC ATATTTCGAC GAAAGCGGCG
CGCAACCTGC TGCCCCATCT CGAACAAGGC CGACGCTATG ACGAAGCTTG CACGATGGCC
GGCTATGACC ATGCTGCCTC CCGCCTTTCC CATCACGGTC AGATCGTCGC AAAGACCCAG
TTCAACGCAC TGGTCACGGA AATCGGGGAA AGCATTGCCA ACCCTATCGC CCGCAAAGCC
CTGATCGAAG GGCTCAAGCA GATCTGGGCG ATGCGTAATC ACTGGGGCTT GCCCGGTTCG
ATCCATGTCG AACTTGCCCG CGATGTCGGC AACAGTATCG AAAAGCGACG GGAAATTGAA
AAACATATTG AAAAAAATAC CGCCCTGCGC GCGCGCGAAC GTCGGGAGGT CCATGATCTT
CTTGATCTGG AAGATGTGAA CGGCGACACA TTGCTGCGTT ACCGACTATG GAAAGAACAG
GGAGGCAAAT GCCTGTATAC CGGCAAGGCT ATCCACATTC GTCAGATAGC AGCCACCGAC
AACAGCGTCC AGGTGGATCA TATCCTGCCT TGGTCCCGCT TCGGCGATGA CAGCTTCAAC
AACAAGACGC TGTGTCTTGC CAGTGCCAAC CAGCAAAAAA AGCGATCGAC GCCCTATGAA
TGGCTCTCCG GACAGACTGG CGATGCGTGG AACGCTTTTG TACAGCGGAT CGAGACCAAC
AAGGAACTGC GCGGCTTCAA GAAGCGCAAT TATCTGCTGA AAAACGCCAA AGAGGCTGAA
GAGAAATTCC GCAGCCGCAA TCTCAATGAC ACGCGCTATG CCGCACGCCT GTTCGCGGAA
GCAGTGAAAC TGCTTTATGC TTTTGGCGAG CGGCAGGAAA AGGGCGGTAA TCGTCGCGTC
TTCACCCGGC CCGGCGCTCT TACGGCGGCT TTGCGTCAGG CATGGGGGGT GGAATCACTC
AAGAAACAGG ATGGCAAGCG CATCAATGAT GACCGCCATC ACGCGCTGGA TGCGCTGACA
GTGGCGGCAG TTGACGAAGC CGAGATCCAG CGGCTGACCA AATCTTTTCA CGAATGGGAA
CAGCAGGGAC TGGGCCGACC ACTGCGACGT GTCGAACCGC CGTGGGAAAG CTTTCGTGCG
GATGTTGAGG CAACCTATCC GGAAGTATTC GTTGCCCGCC CCGAACGCCG CCGCGCGCGT
GGCGAGGGCC ATGCCGCGAC AATCCGGCAG GTGAAAGAGC GCGAATGTAC GCCCATCGTC
TTTGAACGGA AGGCCGTTTC CAGCCTCAAG GAAGCCGACC TTGAACGGAT TAAGGATGGC
GAACGCAATG AGGCCATCGT GGAGGCTATA CGCTCCTGGA TCGCAACCGG CCGGCCCGCC
GACGCCCCAC CCCGTTCGCC ACGCGGCGAT ATTATCACCA AGATCCGTCT GGCGACCACG
ATCAAGGCCG CCGTTCCCGT CCGCGGGGGC ACTGCCGGTC GAGGAGAAAT GGTACGGGCG
GACGTGTTTA GCAAGCCCAA CCGCAGGGGC AAGGACGAAT GGTATCTGGT GCCCGTTTAT
CCACATCAGA TCATGAATCG GAAGGCTTGG CCAAAGCCAC CGATGCGGTC GATAGTTGCC
AATAAGGATG AGGATGAATG GACCGAAGTC GGTCCCGAGC ATCAATTTCG CTTTAGTCTT
TATCCTCGCT CCAATATAGA GATCATAAGA CCGAGTGGAG AAGTGATCGA AGGATATTTC
GTGGGCCTTC ATCGGAACAC GGGGGCACTA ATACCAACTC CGGTCGGCCC TGACTCATAT
GTGATTGCCT GA
 
Protein sequence
MIDESLTFGI DLGIGSCGWA VLRRPSAFGR KGVIEGMGSW CFDVPETSKE RTPTNQIRRS 
NRLLRRVIRR RRNRMAAIRR LLHAAGLLPS TDSDALKRPG HDPWELRARG LDKPLKPVEF
AVVLGHIAKR RGFKSAAKRK ATNISSDDKK MLTALEATRE RLGRYRTVGE MFARDPDFAS
RRRNREGKYD RTTARDDLEH EVHALFAAQR RLGQGFASPE LEEAFTASAF HQRPMQDSER
LVGFCPFERT EKRAAKLTPS FERFRLLARL LNLRITTPDG ERPLTVDEIA LVTRDLGKTA
KLSIKRVRTL IGLEDNQRFT TIRPEDEDRD IVARTGGAMT GTATLRKALG EALWTDMQER
PEQLDAIVQV LSFFEANETI TEKLREIGLT LAVLDVLLTA LDAGVFAKFK GAAHISTKAA
RNLLPHLEQG RRYDEACTMA GYDHAASRLS HHGQIVAKTQ FNALVTEIGE SIANPIARKA
LIEGLKQIWA MRNHWGLPGS IHVELARDVG NSIEKRREIE KHIEKNTALR ARERREVHDL
LDLEDVNGDT LLRYRLWKEQ GGKCLYTGKA IHIRQIAATD NSVQVDHILP WSRFGDDSFN
NKTLCLASAN QQKKRSTPYE WLSGQTGDAW NAFVQRIETN KELRGFKKRN YLLKNAKEAE
EKFRSRNLND TRYAARLFAE AVKLLYAFGE RQEKGGNRRV FTRPGALTAA LRQAWGVESL
KKQDGKRIND DRHHALDALT VAAVDEAEIQ RLTKSFHEWE QQGLGRPLRR VEPPWESFRA
DVEATYPEVF VARPERRRAR GEGHAATIRQ VKERECTPIV FERKAVSSLK EADLERIKDG
ERNEAIVEAI RSWIATGRPA DAPPRSPRGD IITKIRLATT IKAAVPVRGG TAGRGEMVRA
DVFSKPNRRG KDEWYLVPVY PHQIMNRKAW PKPPMRSIVA NKDEDEWTEV GPEHQFRFSL
YPRSNIEIIR PSGEVIEGYF VGLHRNTGAL IPTPVGPDSY VIA