Gene Csal_3168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3168 
Symbol 
ID4028635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3532849 
End bp3533994 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content60% 
IMG OID637968382 
Productputative transposase, IS891/IS1136/IS1341 
Protein accessionYP_575211 
Protein GI92115283 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAC GCGCCTACCA ATACCGCTTC TACCCGACGC CTGAGCAAGC TCAGTTGCTT 
GCCCGGACGT TCGGTTGTGT ACGTTTCGTC TATAACGCGG TGTTGCGTTA TCGCACGGAT
GCCTTCTACC AGCGCCAAGA GCGGATAGGT TACGTTGAGG CCAACGCTTA CCTGACGCGG
ATGAAGAAAG CCGACGAAAC TGCGTTTCTC AACGAAGTTA GTTCGGTTCC GCTTCAGCAG
TGTCTTCGCC ACCAACAGGC GGCCTTCAAG CACTTCTTTG AAGGCCGTGC CCGTTACCCG
GCCTTCAAGA ACAAGCGGCA CCGCCAGGCG GCGGAGTTCA CGCGCTCGGC CTTCAAGTAT
CGAAATGGTC AGCTGTTCCT GGCCAAGTGC AAGGAGCCGC TCGCTATCCG CTGGAGCCGT
GAGCTGCCCA GCGAGCCGAC GACCATCACG ATTTCCAAGG ACTCGGCAGG CCGCTACTTT
GTCAGTTGCC TGTGTGAATT CACTCCCGAG ACGCTGCCCG TCACCCCTCG GATGACCGGT
ATCGATCTGG GCCTGAAAGA CCTGTTTATC ACCGATCAGG GAGAACGAAT CGGCAATCCC
CGCCATACCG CTAAATACGC GGCTCGTCTG GCCAAGGCAC AGCGCCGGTT GAGCAAGAAG
AAGCTCGGCT CCGCCAACCG CGCCAAGGCC AGAAAGCGAG TGGCGCGACT TCACGCGAAG
ATCTCCGATT GCCGAATGGA CCGCTTGCAC AAGCTGTCTC GCAGACTGAT TAACGAGAAC
CAAGTGGTCT GCGTCGAATC CTTGAAAGTG AAGAACATGC TCCGCAACCC GAGCCTTGCC
AAGGCCATTG CCGATGCTGG CTGGGGCGAG TTCGCCCGCC AGCTTGAATA CAAGGCACAG
TGGGCCGGAC GCCAACTGGT TAGGATCGAC CCGTGGTATC CCAGCTCCAA GCGCTGCTCG
GATTGCGGGC ACATCAAAGA CGCGCTACCG CTGAGCGTTC GTGCCTGGGA CTGCCCCGCC
TGCGGGGTTA CCCACGACCG TGACATCAAC GCCGCGCGTA ATATCAAAGC CGCCGGGCTG
GCGGTGTTAG CCCTTGGAGA GAATGTAAGC GGCATGGAGC CCGTCTCCGT GTCCGGTTCT
CGGTGA
 
Protein sequence
MTKRAYQYRF YPTPEQAQLL ARTFGCVRFV YNAVLRYRTD AFYQRQERIG YVEANAYLTR 
MKKADETAFL NEVSSVPLQQ CLRHQQAAFK HFFEGRARYP AFKNKRHRQA AEFTRSAFKY
RNGQLFLAKC KEPLAIRWSR ELPSEPTTIT ISKDSAGRYF VSCLCEFTPE TLPVTPRMTG
IDLGLKDLFI TDQGERIGNP RHTAKYAARL AKAQRRLSKK KLGSANRAKA RKRVARLHAK
ISDCRMDRLH KLSRRLINEN QVVCVESLKV KNMLRNPSLA KAIADAGWGE FARQLEYKAQ
WAGRQLVRID PWYPSSKRCS DCGHIKDALP LSVRAWDCPA CGVTHDRDIN AARNIKAAGL
AVLALGENVS GMEPVSVSGS R