Gene Sfum_1355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1355 
Symbol 
ID4460525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1682181 
End bp1683233 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content62% 
IMG OID639702123 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_845481 
Protein GI116748794 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.403808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGAA CGTACATCCT GGAGCAGGGA GCGTATTTGC GCAAAGCGGG CAATCACCTG 
GTCGTGACCA AGAACCGGGA GATCATAGCG GAGATCCCGC TGGAAGGCCT CAGCCAGCTC
ACCCTGGTGG GCTTTTCCTC CCTCAGCGGA GCGGTCCTGG AAGTGCTCAT CCGCCACCGC
ATCGAAACGG TGTTGCTCAG CCCCAGGGGA CAGTTTCGCG CCAGGCTCAT GGTGGATGAA
CACAAGCACG TCCAACGACG GCAGGGTCAG TACGTCAAGC TTTCCGGGGC CGATTTCGCA
CTGAGGACCA CTCAGAGCAT CGTCCGGGGA AAGCTGCGAA ACACGGCCCG CTTTCTGGCA
CTGCGAGGAA GCAGGTACGG GAGCGAGGCG CTCCACCGGG CGGCGGCACA GATCAAGGGA
CTGTCGGCTC TCGTCGATCG ACAGAAAGAC ATGGACCTGC TGCGCGGGAT CGAGGGGCAT
GCGGCGAACC TGTACTTCGA AGTGTTCCCG CTCCTCGTCC GGGTCCCGGG TTTTGAATTC
AACGGCCGCA ACCGGCGTCC GCCCCTCGAC CCGCTCAATG CGCTTCTCTC GTTTGTCTAC
ACCCTGCTCA CGCAGGAGGT CCTGACGGCC ATCAAGGTCG TGGGGTTGGA CCCTTACCTC
GGCTGCCTTC ACGCGGTCGA CTACGGCAGG CCCTCGCTGG CCTGCGACCT GGTGGAGGAA
TGGCGCACTT TCCTGGGCGA CCGGCTCGTG CTGGCGCTCG TCAACCGTCG CGTCATCGGC
CTCGACGATT TCGTCTACCG TCCCACCCCG TGCGCGGACG CAGTAGACGA AGAGGAGCTG
AAGCATCGCC GGCCGGTGGA GATGAAACCG AAGATCGCCC GGGCATTCAT CGAAGCTTAT
GAGAAGTGGA TGGCAAGCCG TATTCTGGAC CCGGGTTCGA GGGAAAGGAC GGACTATCGC
GGGCTCATTC AGCGCCAGGT CTGGAAATTC TGTCATTATC TCGTGGGGGA CCGCGACTCT
TATGAGCCGT TCATCTGGTC GGAGGTCTCC TGA
 
Protein sequence
MERTYILEQG AYLRKAGNHL VVTKNREIIA EIPLEGLSQL TLVGFSSLSG AVLEVLIRHR 
IETVLLSPRG QFRARLMVDE HKHVQRRQGQ YVKLSGADFA LRTTQSIVRG KLRNTARFLA
LRGSRYGSEA LHRAAAQIKG LSALVDRQKD MDLLRGIEGH AANLYFEVFP LLVRVPGFEF
NGRNRRPPLD PLNALLSFVY TLLTQEVLTA IKVVGLDPYL GCLHAVDYGR PSLACDLVEE
WRTFLGDRLV LALVNRRVIG LDDFVYRPTP CADAVDEEEL KHRRPVEMKP KIARAFIEAY
EKWMASRILD PGSRERTDYR GLIQRQVWKF CHYLVGDRDS YEPFIWSEVS