Gene Sala_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2105 
Symbol 
ID4080080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2211590 
End bp2212690 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID638010481 
Producttransposase, IS4 
Protein accessionYP_617147 
Protein GI103487586 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGGG ACGACATTGG GACGGAGAGG CTGTTTTCCT ATGTGAGTTG CGAGGCTCGG 
GTTTCTGCGA GCCATCCGCT TCGGCCGATC CGGGCGATTG TCGATGAAGT GCTGGAGGTG
CTGCCGGCCG ATTTTGAGGG GATGTACGCG AAGACGGGGC GTCCCTCGAT CGCGCCTGGG
AAGCTGCTGC GCGCGTTGCT GCTACAAGCC TTTTATTCGA TCCGATCGGA ACGCCAGTTG
ATGGAGCAGA TGGACTACAA TCTGCTGTTC CGCCGGTTCG TGGGTCTGTC GATGGATGCG
GCGGTTTGGG ACGCCTCGGT GTTCACCAAG AACCGTGATC GGCTTCTGGA AGGCGATGTG
GCGACCAGGT TCCTCGCCGC GGTCGTGGCG CAGGCCCGAG GCCGCGATCT CCTTTCAGAC
GAGCATTTCT CGGTGGACGG CACGCTGATC GACGCCTGGG CTTCGATGAA GAGCTTCCGC
CCCAGGGATG ATGGCGAGGG ACCGGCGGGG GCCGGGCGCA ATGCCGAACG CGACTTTCGC
GGCGAGAAGC GGTCGAACCA GACGCATGCC TCGACCACCG ATCCCGAAGC GAAGCTCTAT
CGCAAGGCCA ACGGTCAGTC GTCGCGCATG GCCTTCATGG GGCATGGGCT AATGGAGAAC
CGCAATGGCC TGGTGGTCGG CGCGCTCGTC ACTCAGGCCA CAGGCACCGC CGAACGTGAG
GCGGCACTGG TTTTGGTCGA TGAACTCAAA GCCACCGGCC GCATCACCCT GGGCGCGGAC
AAGGCTTACG ACGCACGCGC GTTCGTTCAG GCTCTGCGCG CCCGCAAGGT CACGCCGCAT
ATCGCTCGCA ACGAGCAGAT CAACCAGGCC GGTGAACGAC GACGCAGAAG CGCCATCGAC
GGTCGCACCA CCCGCCATCC CGGCTACGCC ATCAGCTTGG CGGTTCGCAA GCGGATCGAA
GAAGTGTTCG GTTGGGCCAG GACCGTCGGT GGCCCGCGTA AAACGCGCCA CAAGGGCACC
GATCGCGTCG GCCAGGCTTT CACCTTGACC GCCGCCACCT GCAACCTCGT CCGGCTGCCG
AAGCCAATGG TGGCCGCATG A
 
Protein sequence
MRGDDIGTER LFSYVSCEAR VSASHPLRPI RAIVDEVLEV LPADFEGMYA KTGRPSIAPG 
KLLRALLLQA FYSIRSERQL MEQMDYNLLF RRFVGLSMDA AVWDASVFTK NRDRLLEGDV
ATRFLAAVVA QARGRDLLSD EHFSVDGTLI DAWASMKSFR PRDDGEGPAG AGRNAERDFR
GEKRSNQTHA STTDPEAKLY RKANGQSSRM AFMGHGLMEN RNGLVVGALV TQATGTAERE
AALVLVDELK ATGRITLGAD KAYDARAFVQ ALRARKVTPH IARNEQINQA GERRRRSAID
GRTTRHPGYA ISLAVRKRIE EVFGWARTVG GPRKTRHKGT DRVGQAFTLT AATCNLVRLP
KPMVAA