Gene Sare_4163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4163 
Symbol 
ID5707712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4727708 
End bp4729156 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content73% 
IMG OID641273590 
ProductAraC family transcriptional regulator 
Protein accessionYP_001538943 
Protein GI159039690 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.813535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00229992 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGACCTGG ACTTCGAGCG GTGTTATCGG GCCGTCGACA GCCGTGACCA GCGGTTTGAC 
GGCTGGTTCT ACACGGGCGT GACCTCCACC GGCATCTACT GTCGGCCGTC TTGTCCGGCG
ATCACTCCGA AACGGGAGAA CATCCGGTTC TTTCCGTCGG CCGCCGCAGC GCAGGAGGCC
GGGCTTCGGG CCTGCCGTCG GTGCCGGCCG GATGCGACCC CGGGCTCACC GCACTGGGAC
GTCCGCGCCG ACGTGGTCGG TCGCGCCATG CGACTGATCG CCGACGGCGT GGTCGACCGG
TCCGGGGTAC CCGGCCTGGC GGCACAGCTC GGCTACACAG AGCGGCACCT GCACCGGATG
CTCCGCACCG AACTGGGGGC CGGCCCGCTC GCGCTGGCCC GCGCGCAGCG CGCGCAGACC
GCGCGGACCC TGATCGAAAC CACCGACCTC GGAATGGCGG AGATCGCGTT CGCCGCCGGG
TTCGGCAGCG TTCGGCAGTT CAACGACACG GTCCGCGAGG TGTACGCGGT TGCCCCGTCC
GAGCTTCGAG CGGTCCGGAG CCGACGGACG TCCCCCGCCG GACCCGGAAC GATCACCGTA
CGGCTGGCGT ATCGGCCCCC ACTGCATGTC GGGGCGCTGC TGGACTTCCT CGCCCCGCGG
GCGCTGCCCG GTGTCGACGA GGTGCGCGCG GGGGCCTATC ACCGCGGCCT GCGGCTGCCA
CACGGCACCG GCGAGGCTTC GCTGACTCCG ACGGACAGGC ACGTGGAGGC GACCCTACGC
CTGTCCGACC TGCGGGACCT GGCGCCGGCG GTGGCCCGCT GCCGCCGGCT GCTCGACCTC
GACGCCGACC CGACGGCAGT GGACGCCGTC CTGGCCACCG ACCCCGCCCT GGCCGCCGTG
GTCACGGCGG AGCCCGGAGT CCGGGTGCCG CGCGCGGTCG ACGGCTTCGA GGTGGCCGTC
CGCGCGGTCA TCGGCCAGCA GGTCTCGGTG GCGTCCGCCC GCACCACCCT CACCCGTCTC
CTGAACGAGC TACCCACCCT GGCCGACAGG TCGGATGGGG TGACCGGTGG ATTGCACGCG
TTTCCCTCCG CCGAGGAGGT GCGCAACGCG CCGGACTCAG CATTCCGGAT GCCGGCCGCC
CGCCGGGAGA CGCTGCGCCG GCTCGCGGGG GCGGTTGCCG CCGGGGAGCT CGACCTGGAA
CCGGGTGGGG ATCGGAAGGA GACCCGGCAA CGGCTGCTGG CGCTGTCGGG CATCGGCGCG
TGGACGGCGG ACTACATCAC GCTCCGCGCG TTGGGCGACC CGGACGTGTT CCTTCCCACC
GACGTTGCCG TCCGCCGGGG CGCTGCCGCC CTCGGTCTAC CTAGCACCCC GGACACCCTG
CACACGTACG CCGACCGCTG GCGCCCCTGG CGCTCATACG CGGTGAGCCG ACTTTGGAGA
GCAGCATGA
 
Protein sequence
MDLDFERCYR AVDSRDQRFD GWFYTGVTST GIYCRPSCPA ITPKRENIRF FPSAAAAQEA 
GLRACRRCRP DATPGSPHWD VRADVVGRAM RLIADGVVDR SGVPGLAAQL GYTERHLHRM
LRTELGAGPL ALARAQRAQT ARTLIETTDL GMAEIAFAAG FGSVRQFNDT VREVYAVAPS
ELRAVRSRRT SPAGPGTITV RLAYRPPLHV GALLDFLAPR ALPGVDEVRA GAYHRGLRLP
HGTGEASLTP TDRHVEATLR LSDLRDLAPA VARCRRLLDL DADPTAVDAV LATDPALAAV
VTAEPGVRVP RAVDGFEVAV RAVIGQQVSV ASARTTLTRL LNELPTLADR SDGVTGGLHA
FPSAEEVRNA PDSAFRMPAA RRETLRRLAG AVAAGELDLE PGGDRKETRQ RLLALSGIGA
WTADYITLRA LGDPDVFLPT DVAVRRGAAA LGLPSTPDTL HTYADRWRPW RSYAVSRLWR
AA