Gene Sare_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1703 
Symbol 
ID5704014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1968167 
End bp1970560 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content71% 
IMG OID641271206 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001536581 
Protein GI159037328 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.247477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00353536 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCCTCT CTGTTCATCA GCGGATCGCC GAGGAACTCG GCGTCGCCGA GCGCCAGGTA 
CGCGCAGCCG TGGAACTACT CGACGGCGGC GCGACCGTGC CGTTCATCGC CCGCTACCGC
AAGGAGGCCA CCGGCCTGCT CGACGACACC CAGCTGCGCA CCCTCGAGGA GCGGATGCGC
TACCTGCGCG AGTTGGACCA GCGCCGGACT GCGGTCCTGG AGTCGATCCG GGGCCAGGGC
AAGCTCGACG AGACCCTGAC GGCACAGATC ATGGCAGCCG ACTCGAAGTC TCGGCTGGAG
GACATCTATC TGCCGTACAA GCCGAAGCGG CGGACCCGGG CACAGATCGC GCGCGAGGCT
GGACTGGAGC CACTCGCCGA CACACTGCTC GACGATCCCG CCCAAGACCC ACGCGCGACG
GCCGTCAGGT TCGTCGACCC GGACCGGGGC ATCGCCGACC CGTCCGCCGC ACTGGACGGT
GCCCGCGCCA TCCTCGTCGA ACGGTTCGCC GAGGACGCCG ACCTGATCGG CACGCTACGC
GAGCAGATGT GGTCACGGGG CCGGCTGGTG TCCCGGGTAC GCGATGGTCA GGCCACGGCC
GGCGCCAAGT TCGCCGACTA CTTCGACTTC GCCGAGCCGT ACCCGAAACT GCCCTCGCAC
CGGGTCCTCG CCGTGTTCCG GGGGGAGAAG GAGGGTGTGC TCGACCTGAC CATGGAGCCG
GAGCAGCAGG AGAACCCGGA TCCAGCGACC ACCGGTCCGA CCCGGTACGA GGCGGCCGTC
GCCGCCCGGT TCGGGGTCAG TGACCGGGGA CGGCCGGCCG ACCGGTGGCT CTCCGACACG
GTGCGCTGGG CCTGGCGTAC CCGAATCCTG ATCCACCTCG GCGCGGACCT TCGCATGCGG
TTGTGGCAGG CCGCCGAGCA GGAAGCGGTG CGGGTCTTCG CCACGAACCT GCGGGACCTG
CTGCTGGCCG CCCCGGCCGG GGCCCGGACG ACGATGGGCC TGGATCCCGG CCTGCGCACC
GGGGTGAAGG TCGCCGTCGT TGACGCGACG GGCAAGGTGG TCGCCACCGA CACCATCTAC
CCGCACGAGC CGCGCCGGCA GTGGGACGCC TCGATCGAGA CCCTCGCCCG TCTCGCCACC
GCGCACCAGG TCGAGTTGGT CGCGATTGGT AACGGCACCG CGAGCCGGGA GACCGACCGA
CTCGCCGCAG AGCTGATCCG GCGCCACCCA CAGCTGAACC TCACCAAGCT CGTCGTGTCC
GAAGCCGGCG CTTCGGTCTA CTCAGCGTCC GCGTACGCCG CGCAGGAGCT GCCGGGCCTG
GACGTGTCGC TGCGGGGGGC GGTCTCCATC GCCCGTCGCC TCCAGGACCC ACTCGCCGAA
CTGGTCAAGA TCGATCCCCG GTCCATCGGA GTCGGGCAGT ACCAACACGA CCTGTCCGAG
GTGACGTTGT CCCGGTCGCT CGACGCGGTG GTCGAGGACT GCGTCAACGC GGTCGGCGTC
GACGTCAACA CCGCCTCCGC GCCACTGCTG ACCCGGGTCT CCGGCATCGG TGCCGGACTG
GCGGAGAACA TCGTGCTGCA CCGGGACGCC AACGGGCCCT TCCGAACCCG GGGCGACCTG
CGACGGGTAC CCCGGCTTGG TCCGAAGGCA TTCGAGCAGT GCGCGGGCTT CCTGCGCATC
CCCGACGGTG CCGACCCGCT GGACTCGTCG AGCGTGCACC CGGAGGCGTA CCCGGTGGTG
CGGCGGATCC TCGCCGCCAC GAAGCAGGAA CTGCGGATGG TGATCGGCCG CAGCGCGGTC
CTGCGCGGGC TGCGGGCCGC CGATTTCGTC GACGAGACCT TCGGGCTGCC GACGGTCACC
GACATCCTCG CCGAGTTGGA GAAACCCGGC CGGGATCCGC GGCCGGAATT CCGCACCGCC
ACGTTCACCG AGGGCGTGGA GACGATCACC GACCTGGTGC CCGGGCTGAT CCTCGAGGGC
GTGGTCACCA ACGTCGCCGC CTTCGGCGCG TTCGTGGACG TCGGCGTGCA TCAGGATGGC
CTGGTACATG TCTCGGCGAT GTCCCGCGCC TTCGTTCGCG ACCCTCGCGA GGTGGTGAAG
TCCGGTGACG TGGTGAAGGT CAAGGTCCTC GACGTGGACG TGCCACGCAA GCGCATCTCG
CTGACCCTTC GACTGAACGA TACCGAGGCC GGTCGCGGCG GAGCGCATGG CCAGCGGGAC
CGCGGTGGCG ACCGGGAGGC CAACCGCGGC GAGTCCCGAG GCCGCGGTGG GCAGCAGGCC
CGCGGTGGGC AGCAGGCCCG CGGTGGGCAG CCGCAACCCA GACGTGGCGG CGCCACGCCG
CCCCCGGCCA ACGACGCGAT GGCCGATGCC CTGCGTCGCG CCGGCCTCGC CTGA
 
Protein sequence
MTLSVHQRIA EELGVAERQV RAAVELLDGG ATVPFIARYR KEATGLLDDT QLRTLEERMR 
YLRELDQRRT AVLESIRGQG KLDETLTAQI MAADSKSRLE DIYLPYKPKR RTRAQIAREA
GLEPLADTLL DDPAQDPRAT AVRFVDPDRG IADPSAALDG ARAILVERFA EDADLIGTLR
EQMWSRGRLV SRVRDGQATA GAKFADYFDF AEPYPKLPSH RVLAVFRGEK EGVLDLTMEP
EQQENPDPAT TGPTRYEAAV AARFGVSDRG RPADRWLSDT VRWAWRTRIL IHLGADLRMR
LWQAAEQEAV RVFATNLRDL LLAAPAGART TMGLDPGLRT GVKVAVVDAT GKVVATDTIY
PHEPRRQWDA SIETLARLAT AHQVELVAIG NGTASRETDR LAAELIRRHP QLNLTKLVVS
EAGASVYSAS AYAAQELPGL DVSLRGAVSI ARRLQDPLAE LVKIDPRSIG VGQYQHDLSE
VTLSRSLDAV VEDCVNAVGV DVNTASAPLL TRVSGIGAGL AENIVLHRDA NGPFRTRGDL
RRVPRLGPKA FEQCAGFLRI PDGADPLDSS SVHPEAYPVV RRILAATKQE LRMVIGRSAV
LRGLRAADFV DETFGLPTVT DILAELEKPG RDPRPEFRTA TFTEGVETIT DLVPGLILEG
VVTNVAAFGA FVDVGVHQDG LVHVSAMSRA FVRDPREVVK SGDVVKVKVL DVDVPRKRIS
LTLRLNDTEA GRGGAHGQRD RGGDREANRG ESRGRGGQQA RGGQQARGGQ PQPRRGGATP
PPANDAMADA LRRAGLA