Gene Gdia_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0427 
Symbol 
ID6973821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp466452 
End bp468092 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content65% 
IMG OID643389959 
ProductCRISPR-associated protein, Cse1 family 
Protein accessionYP_002274838 
Protein GI209542609 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00538764 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.144998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTC TGACCGCTTC ATGGCTGCCC ATTCGCCGGA AATCCGGCGC GGCGGAGACC 
ATTCGTCCGG CGCAGATCGT GGACCGCGTG GCCGACGATC CGATCATGGC GCTCGATTGG
CCACGGGCGG ATTTCCGCAT CGCGTCCCTT GAATTCCTGA TCGGATTGCT GGCAACGGCG
TTTCCGCCCA AAAACGAGGA TATCTGGTGC GAAACCTGGG AAGACCCACC TTCCGTGGAG
GCGCTGGACG AAGCCTTCGC CCCTGTGGCC GAAGCCTTCT GGCTCGATGG GCCGGGGCCA
CGCTTCCTGC AGGACCTGGA AAATCTCCAG AGCGGCCAGG AGCCTGTGGA ACGTCTGCTG
ATCGATGCGC CTGGCGATAG CACCGTAAAG AAGAACACGG ATCTGTTCGT GCATCGGCAG
CGGATCATGG CGCTGGGGCG TCCGGCCGCC TGCATGGCCC TGTTTACCCT CCAGTCCTGG
GCGCCGAGCG GCGGTGCGGG AAACATGACC GGCCTGCGGG GCGGCGGCCC CCTGGTCACG
CTGGTCCTGC CCCGGGAGGG AGCGAGCCTG TGGGAGATGG TATGGGCGAA TACGCCCTTC
GGGGTGCCTC CGTCCGAAGC AGACCTGCCC CGTGTGTTCC CGTGGCTCGC GCCGACAATC
GGGTCAGGCA AGGATGGAAC CAGTGTACGC CCAGGCCATA ACGCCCATCC GCTGCAATGC
TGGTGGGGAA TGCCGCGGCG TATTCGCCTC GATTTCGAAG CGGCGGAGGA CGGCATATGC
GACCTGACGG GGCAGCCGGA TGCCGTTCTG GTACCGGGCT GGCGACAGCG TCCTTATGGC
GCAAGCTACG CCGACTGGAC AGGGATGCCC TATGGCGCGG GCGCATCCAT CCACCCGTTG
ACGCCCCGTT ATCGCCAGAA GAAAGACGCC GAATGGCTGA GCGTCCACCC GCAGCCGGGT
GGAATCGGAT ACCGCCATTG GGCCGGGATT GTCGTGAACA GCAGCGATAC GCACCGGCTT
CCGGCAAGCA CGGTGCTGTC ATGGCGTAAC GACCGGGCCC GGAACGTTGC CGCCTCGCTC
ACGCCTCGGC TTCTGGCGGC CGGTTATGAC ATGGACAATA TGAAGGCCCG CAGCTTCGTC
GAGAGCGAGA TGCCGCTGCC CGGCGTGGTC GATCCGGTGC GTCAGGAGGC GCTTGACGCA
CTGGCCCGAG CGTATGTCGA GGCCGCCGAC CAAGTGGCGG GTATCCTCCG ACAGTGTGTC
CGGGAGGCCC TGTTTGGCAA GGGGACCATC TCGCCAGATG CAACGCTGTT CTCCGGCCTG
CGCGAACGGT TCTGGGCGCA GACGGAGGGA ACGTTCTTCG ATCTTCTCCA TCAGGCGGTG
CTGCTGGGTG ACGGAGATGA CATCGATCTG CGGCGCATCT GGCTGCGCGC CCTTCGGCGG
GTGGCCCTGG ACCTGTTCGA CTCCGCCGTC ATGCTGACTC CGGACACGGG AACGACCGAG
GCGCAACGAT CCGCGCTGGC GCGCCGGCGG CTCGGGGCGG CTGTCGCGGG CGGTGGAAAG
GAAGGGCAGG CGCTCATGAT GGTTCTGAAA ATTCCCGTGC CCGTTACACG GAAAGCGACG
TCCAGGATGA AGCAGCCATG A
 
Protein sequence
MNLLTASWLP IRRKSGAAET IRPAQIVDRV ADDPIMALDW PRADFRIASL EFLIGLLATA 
FPPKNEDIWC ETWEDPPSVE ALDEAFAPVA EAFWLDGPGP RFLQDLENLQ SGQEPVERLL
IDAPGDSTVK KNTDLFVHRQ RIMALGRPAA CMALFTLQSW APSGGAGNMT GLRGGGPLVT
LVLPREGASL WEMVWANTPF GVPPSEADLP RVFPWLAPTI GSGKDGTSVR PGHNAHPLQC
WWGMPRRIRL DFEAAEDGIC DLTGQPDAVL VPGWRQRPYG ASYADWTGMP YGAGASIHPL
TPRYRQKKDA EWLSVHPQPG GIGYRHWAGI VVNSSDTHRL PASTVLSWRN DRARNVAASL
TPRLLAAGYD MDNMKARSFV ESEMPLPGVV DPVRQEALDA LARAYVEAAD QVAGILRQCV
REALFGKGTI SPDATLFSGL RERFWAQTEG TFFDLLHQAV LLGDGDDIDL RRIWLRALRR
VALDLFDSAV MLTPDTGTTE AQRSALARRR LGAAVAGGGK EGQALMMVLK IPVPVTRKAT
SRMKQP