Gene GSU1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1387 
Symbol 
ID2687896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1516947 
End bp1518041 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content55% 
IMG OID637126062 
ProductCRISPR-associated Cse4 family protein 
Protein accessionNP_952440 
Protein GI39996489 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACT TCATCAACTT TCACATCCTG ATTTCCCACA GTCCCTCCTG CCTTAACCGC 
GACGATATGA ACATGCAGAA ATCTGCTGTT TTCGGTGGTG AGCGGCGGGT GCGCGTTTCC
AGTCAAAGCC TCAAACGGGC CATCCGAAAG AGTGATTACT ATCGTCAGCA CCTTGGCGAA
GCGAGTGTGC GCACCAAGAA GTTGGACGAA CTGATCGCGA TCATAAATGA TCGTCTGGCC
GGACGCTACG ATACCGACCT CCTGAAGAAG ACTGTTGGGC TGCTGGCCGG CAAGGAGTTA
AGTGTCGAGG TTGCGACAGA AGGTGATGCC GTGGCGCCGT GGGCAATCGA AGAGGTGGCA
TGGTTCTGTG AGCAGGTCAA GAGGATGGTG GCGCAAGGAC AGGACGAAAA AGCTCTGGGC
AAATTGTTGA AGAATGAAAC GGCTGCCATG CGGCAGGCTC TGGCATCCGG TGTTGATATT
GCACTTTCCG GCCGCATGGC GACGTCAGGT CTCATGAGTG AACTCGGCAA GGTCGATGGT
GCCTTGGCCG TTGCCCATGT CTTGACCACC CACAGCGTTG ATGCGGATAT CGACTGGTTC
ACTGCCGTGG ATGACTTGCA GGAACTGGGC TCCGGTCATC TCGATACGCA GGAATTTTCC
AGCGGGGTCT TTTATCGTTA TGCCAGCCTC AACGTGAAGC AGTTGCAGGA AAACCTGGGC
AATGCCCCGC GCGAGAAGGC ACTGGAGATC GGCGCTCATC TGCTTCACAT GCTGGCAACG
ATTGTCCCTT CAGCCAAACA GCAGAGCTTC GCAGCTCACA ACCTGGCCGA CCTTGCCCTG
GTTTCCTTTT CCGATATCCC GGTATCGCTC GCAAATGCAT TCGAAAAACC TGTCCGTAGC
GTCAATGGCA GCGGCTTCAA AGAGCCTTCC ATTGCTGAAC TGCATAACTA CTGGCAGCAG
ATCCATACAG GTTACGGCCT TTCCGAGCGG TGCGGCGAGT TCATCCTCGG TCAGAGTAGC
GTCCCTGAAG GGATCACCCG GAAGAGTACT ATTGAAGAAC TCAAAACCTG GGTGATGAAC
AACGGAGAGG GGTAA
 
Protein sequence
MKNFINFHIL ISHSPSCLNR DDMNMQKSAV FGGERRVRVS SQSLKRAIRK SDYYRQHLGE 
ASVRTKKLDE LIAIINDRLA GRYDTDLLKK TVGLLAGKEL SVEVATEGDA VAPWAIEEVA
WFCEQVKRMV AQGQDEKALG KLLKNETAAM RQALASGVDI ALSGRMATSG LMSELGKVDG
ALAVAHVLTT HSVDADIDWF TAVDDLQELG SGHLDTQEFS SGVFYRYASL NVKQLQENLG
NAPREKALEI GAHLLHMLAT IVPSAKQQSF AAHNLADLAL VSFSDIPVSL ANAFEKPVRS
VNGSGFKEPS IAELHNYWQQ IHTGYGLSER CGEFILGQSS VPEGITRKST IEELKTWVMN
NGEG