Gene Sare_1922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1922 
Symbol 
ID5708275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2218415 
End bp2219620 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID641271427 
Productintegrase family protein 
Protein accessionYP_001536798 
Protein GI159037545 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.612381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00155149 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAGTC CTCAACACCC GCCAATAGGC GTACGGCTTG CGCCTGATGT GGAGTTCCGG 
CCTGGCCGAG AAAACTCCTA CCGGGCACGG GTCCGGTGGA TCGATCCGGC CACGAAGCGC
CGTCTGTCCA AGTCAACGAG TGTGGCCACG TCGGAGGAAG CGCAAGCCTG GATCGATGGG
CTCATGAGTG CCGCGCAAGG TGGCATCGAC CCGACCGCCG CCACCAAGCG GCTAACCGAG
TATGGCGAGA GTGTAATGAC GCTGGCCCTA CGCGGGCTGG AAGGCAAGAC GCTCGATCCG
TATCTGGCTG GGTGGCGGAA ACGGGTTGTT CCCACGCTCG GTCACATCCC GGTTCGCATG
ATTACCAATG GCGCGGTTGA CCGCGCTGTA CATAGCTGGA TTGCCGACGA ATGCAGCCGC
TCGACGGTGA AGAACAGCCT CGCCGTTCTG GTTCGCGTGA TGGAACAGGC GGTGCGGGAC
GGCATCATCG CTCGCAATCC CGCCCAGGTC ACGGGATGGC AGCGCGAATA CCAGCAAGCC
GAGGACGAAT TGGACGATCC CCGCTCGCTG GCGCTCTCCG ATTGGGAGGC GCTAACCGCA
CTCGCTGCCG CGTTGGTCGA ACGGTCGGCC AACGCCTTCA CCGGGTGGGC GGACGTGGTG
ATTTTCGCTG CCTGCACCGC CGCGCGAATA GGCGAGGTAT CGGGCGTTCG GGCCGAGGAC
ATCAACCGGG ATACGTGGAT GTGGACCGTG CGCCGGCAGA CCACGCCCGG CCCCGGTGGC
CTGATCGATA AGGGCACCAA GGGCAAGCGC GCCCGGATGG TTCCGCTGAT CGAGGAAGTG
CGGCCGCTCG TGACGCACCG CCTGGGGGTG GCGACCAAAC CCGACGCACG GCTGTTTACC
GGCCCGCGCG GTGGCCGTAT TTCCACGGCC GTTCTCCGCG ACGCGACTCA TTGGGATGAG
GTGGTGACGA AGCTCGGCTA CGAGCACCTA CGCCGACACG ACCTGCGGCA CACCGGGTTG
ACCTGGATGG CCGACGCTGG CGTGCCGGTG CACGTCCTGC GGAAAATCGC CGGACACGGG
TCGCTCACCA CGACCCAGCG ATACCTACAC CCCGACCGAC AGGCGATCAC GGACGCCGGC
ACGGCGCTCA GCGCCCACTT GAAGGCCCGC CGGTCCCCAG GTGGTCCCCA GCTACGCGCC
GTCTAG
 
Protein sequence
MASPQHPPIG VRLAPDVEFR PGRENSYRAR VRWIDPATKR RLSKSTSVAT SEEAQAWIDG 
LMSAAQGGID PTAATKRLTE YGESVMTLAL RGLEGKTLDP YLAGWRKRVV PTLGHIPVRM
ITNGAVDRAV HSWIADECSR STVKNSLAVL VRVMEQAVRD GIIARNPAQV TGWQREYQQA
EDELDDPRSL ALSDWEALTA LAAALVERSA NAFTGWADVV IFAACTAARI GEVSGVRAED
INRDTWMWTV RRQTTPGPGG LIDKGTKGKR ARMVPLIEEV RPLVTHRLGV ATKPDARLFT
GPRGGRISTA VLRDATHWDE VVTKLGYEHL RRHDLRHTGL TWMADAGVPV HVLRKIAGHG
SLTTTQRYLH PDRQAITDAG TALSAHLKAR RSPGGPQLRA V