Gene Sare_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1650 
Symbol 
ID5703551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1898599 
End bp1899897 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID641271156 
Producttransposase IS4 family protein 
Protein accessionYP_001536531 
Protein GI159037278 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.156297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0326148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGCGT GCCACAGACT AGACCTTGAT CGGTGGCGGG TCATCCTGGA CGAGGCGGTG 
ACGCGTGTCG CTGACCGGTT CGTCCGGGCG GAGCCGCGTG CGACGGCGGG ACAGTTCGTG
GAGGGGCTGC TGTCGGGGGT GGAACGTAAG ACCTGCTGGT CCCTGGCGGA GCGGGCTGGT
CACGCCGACC CGCAGGCGAT GCAGCGGCTG CTGCGTACGG CGGTGTGGGA CGCCGACGCC
GTCCGCGACG ATGTGCGGAA TTGGCTGGTC GAGCAGCTCG GCCACCCCGA CGGTGTGCTG
GTCACCGACG AGACTGGCTT CCTCAAGAAG GGCGTGTGCT CGGTCGGGGT CCAGCGGCAG
TACACCGGCA CCGCCGGACG TGTGGAGAAC AGCCAGGTCG GGGTGTTCCT GGCCTACGTG
TCACCTGCCG GGCGGGCGTT GATCGACCGT CGGCTCTACC TGCCGGAGAC GACCTGGTGC
GACCAGCCCG ACCGGCTCGC TGCCGCCGGC GTCCCAGACG ACGTCAGGTT CGCCACGAAA
CCGGCCCTGG CCCGGCAGAT GATCGCCGCC GCGCTGGACG CCGGTGTGCC CGCCGGGTGG
GTGACTGGCG ACGAGGTTTA CGGCGCCGAC CCCGGCCTGC GCGACGACCT CGAAGACCGC
GGCATCGGCT ACGTCCTGGC CGTCGGCTGT GACCGACGGG TACACGTCAA CGACGGACGC
ACCCTCGTAC GGGTCGATCA CCTCGCCGAG CGGATTCCCA CCGCCGAGTG GCAGTTGCAC
AGTTGCGGGC CGGGGGCGAA AGGTCCCCGC GACTACCTGT GGGCCTGGAT CATCACCGCC
ACCCGACCCG GTGAGCACCA GTGGCTGCTT ATTCGCCGCA ACCGCAGCAC CGGCGAGCTG
GCCTTCTACC TGTGCTGGTC ACCTCGCCCG GTGCCGCTGC ACACCCTCGT GACCGTGGCC
GGCTCCCGCT GGAGCATCGA GGAGTTGTTC CAGACCGGCA AAGGCCAGGT CGGCCTGGAC
CACTACCAGG TCCGCGGCTG GACCGGCTGG CACCGCTTCC TCACCCTGGC CATGCTCGCC
CTGGCCGTCC TGACCATCCT CGCCGCCACC ACCGCCCAGC AGACCGACGC CGACCCGGAG
ATCATCGCGT TGACCGTCGC CGAGATCCGG CGACTCCTCA ACGCCCTCGT TCTGGCCCTG
CCCCTACCAG CAGCGCACAC CCTGCACTGG TCGATCTGGA GACGAACATC CCAAGCCCGA
GCCCGCCGAT CCCACTACCA GCGCAGACAG GCGAAGTGA
 
Protein sequence
MAACHRLDLD RWRVILDEAV TRVADRFVRA EPRATAGQFV EGLLSGVERK TCWSLAERAG 
HADPQAMQRL LRTAVWDADA VRDDVRNWLV EQLGHPDGVL VTDETGFLKK GVCSVGVQRQ
YTGTAGRVEN SQVGVFLAYV SPAGRALIDR RLYLPETTWC DQPDRLAAAG VPDDVRFATK
PALARQMIAA ALDAGVPAGW VTGDEVYGAD PGLRDDLEDR GIGYVLAVGC DRRVHVNDGR
TLVRVDHLAE RIPTAEWQLH SCGPGAKGPR DYLWAWIITA TRPGEHQWLL IRRNRSTGEL
AFYLCWSPRP VPLHTLVTVA GSRWSIEELF QTGKGQVGLD HYQVRGWTGW HRFLTLAMLA
LAVLTILAAT TAQQTDADPE IIALTVAEIR RLLNALVLAL PLPAAHTLHW SIWRRTSQAR
ARRSHYQRRQ AK