Gene GSU0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0053 
Symbol 
ID2688405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp69225 
End bp70316 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content54% 
IMG OID637124718 
Producthypothetical protein 
Protein accessionNP_951115 
Protein GI39995164 
COG category 
COG ID 
TIGRFAM ID[TIGR02570] CRISPR-associated protein, GSU0053 family, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATC TCGTGCAGAA GTATGACCAT TGGTTGGAAA ACTCCGGACC TGCGGCACTG 
GTTATTCGCG AACAACTGAT GCCCGTCGAG GGACGTGACG GTGTGCTGTT TCCAGCGACC
TTTGCCGATA CCGGCTACAA CATCGACAAA TTCGACGATG GCGGCAATGT CTGCCTGATC
GACAGTGTCG GGTCCCAGGC AAACAGGATC GAGCCGATCT TCATGACTAA GGATTACGCT
GGCCTTGTCC CCCAAATAGT GGTCCAGGCG GGAAACAAAA AAGTAAATCT TCTCGAAGCA
GGGCATCGAG CCGGGGACGC GATTATTCGC TGTTCTGAGT TGCAGCAAAC CCTTAGGGCT
GCGTTCAACA ACGTTCTGAA TGGCAATGCA GAGCCACTAG CCCGTATAGC ACCCACCTCG
CTTGTGTTTG GCGTGTGGGA TTCACGAGAT ACCCAAGCCA AATTGCCCAG ACTCGTTGCC
TCGACCATAA GGGCCTACAA TGTTCGCCCT CTCACCCGCT CTGCCCAGTA TGTGCCGGCT
GTTGACTACA ACGCCGAAGG GCTTTTGGAA GAGCCCGGTG ACTTGCGAGA TGCTGAAGGC
AAAGTCAAGA GCAAGCACCC GTTTGCCCAA CGCGGGTTTG TGCATGTCCC GGCGACAGGT
GCTCTCGGCG GCGTAATCGC CACCGGGGGG ATTCGCCGTG ACGCCACACT CCACCTTGCC
GCGCTCCGCT TGCTTTCGGC AGGCCAAGAC GAAGCAAAGT CCAAGGCCCT TCGCCGCTAT
ATACTCAGTC TTGCCTTAAC AGCATTTACT GTGCCTGTAA CTGGCTATCT GCGTCAGGGC
TGCAATCTTG TGCTCGACCC TGAAAACCCC CTTGAGTTTA AAGAGGTTTT TAATGATGGG
ACGCGCAATG ACGTCGGTAT TACGCACACC GAAGCGATTG TCTATGCAAA GGCAGTTGCA
AAGGAGTTTG GCATTGACCC CGAGCGTAAC CTTGACGAAA AAAAAGCCCC GGATCGAGAA
GTACCGTTTG ACAAGGTACT GGCGAAAAAA GATGTGAGCG ATGCCGGAGG CTCTAAGAAA
AAAGCAAAAT GA
 
Protein sequence
MNDLVQKYDH WLENSGPAAL VIREQLMPVE GRDGVLFPAT FADTGYNIDK FDDGGNVCLI 
DSVGSQANRI EPIFMTKDYA GLVPQIVVQA GNKKVNLLEA GHRAGDAIIR CSELQQTLRA
AFNNVLNGNA EPLARIAPTS LVFGVWDSRD TQAKLPRLVA STIRAYNVRP LTRSAQYVPA
VDYNAEGLLE EPGDLRDAEG KVKSKHPFAQ RGFVHVPATG ALGGVIATGG IRRDATLHLA
ALRLLSAGQD EAKSKALRRY ILSLALTAFT VPVTGYLRQG CNLVLDPENP LEFKEVFNDG
TRNDVGITHT EAIVYAKAVA KEFGIDPERN LDEKKAPDRE VPFDKVLAKK DVSDAGGSKK
KAK