Gene Sare_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1208 
Symbol 
ID5703992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1357773 
End bp1358864 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content72% 
IMG OID641270726 
Productintegrase family protein 
Protein accessionYP_001536107 
Protein GI159036854 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGGG GTGGGGCGAC CCGTGCCCTG CACGCGGCAC TTCCCCCGCC GCTGCGGGAG 
GCGGTTGACG ACTTCGCGAA CCACCTGTCC CAGGTCCACA ATCGGTCAGC CCACACCGTC
CGGGCCTACG TCACGGATGT GGTTCACCTG CTCGACCACG CCGTACGGGC CGGGATCCGT
ACCCCCTCCG ACCTCACGCT GGCACAGGTA CGCAGTTGGC TGGCCCGACA GCGAACGACG
GGAGCGGCCA GGTCCACCCT CGCCCGACGG GCCGCCGCAG CCCGTACCTT CAGTGCCTGG
GCGCACCGGT GCGGCTGGAT ACCGACAGAT GTGGCCGCAC CACTGGCAAG TCCGCGAGCC
CAGCGGGAGC TACCCGCCGT ACTTCCGGTC CACCAGGCCG CCGCCCTGCT GGAGACCGCG
CACCACGCGG GACGCGGTCG GTCGAGGCAG AAGCAACCAC CGACGTCCGA TGCGCGACCA
GCCAGCGCGG CGGATACGAT GCCCGGCTCC CACAGCAGCC GCCACCAGAC TGGCGAGAAC
CGCACCGGCG GAGCCGGGCA ACACGGCGTC CCGTCCGACG CCAACGACCC GGTTCAGCTA
CGGGACTTGC TGCTACTGGA ACTCCTGTAC GCGACGGGGG TCCGGGTCAG TGAGGCGTGC
GGGCTGGACA TCGCGGACGT GGACCCGGGC CGGCGGGTGC TGCGGGTACT CGGCAAGGGA
AACCGGGAAC GCACCGTGCC GTACGGTGTC CCGGCGCAGC GAGCACTCGA CGCGTGGCTG
CGCCACGGCC GTCCCTGGCT GGCCGGGCCC CGGTCGGCGA ACGCGCTGCT GCTCGGGGCC
CGAGGAGGTC GACTCAACCC GACCACTGCG CGGGGAGTCG TCGCCCGCTG CGCGGCAGCC
GCCGGCCTGC CCCCGACCAC CCCGCACGGG CTACGGCACG CGACAGCCAC CCATCTGTTG
GAAGGTGGCG CGGACCTGCG GACGGTACAG GAGCTGCTCG GGCACACATC GCTGGCCAGT
ACCCAGATCT ACACCCACGT GTCGGTCGAG CGGCTGCGGG CCGCGTACCG ACAGGCCCAC
CCGCGCGCGT GA
 
Protein sequence
MNRGGATRAL HAALPPPLRE AVDDFANHLS QVHNRSAHTV RAYVTDVVHL LDHAVRAGIR 
TPSDLTLAQV RSWLARQRTT GAARSTLARR AAAARTFSAW AHRCGWIPTD VAAPLASPRA
QRELPAVLPV HQAAALLETA HHAGRGRSRQ KQPPTSDARP ASAADTMPGS HSSRHQTGEN
RTGGAGQHGV PSDANDPVQL RDLLLLELLY ATGVRVSEAC GLDIADVDPG RRVLRVLGKG
NRERTVPYGV PAQRALDAWL RHGRPWLAGP RSANALLLGA RGGRLNPTTA RGVVARCAAA
AGLPPTTPHG LRHATATHLL EGGADLRTVQ ELLGHTSLAS TQIYTHVSVE RLRAAYRQAH
PRA