Gene Sala_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1920 
Symbol 
ID4082777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2023217 
End bp2024695 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content70% 
IMG OID638010297 
ProductABC transporter related 
Protein accessionYP_616965 
Protein GI103487404 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGCT GGCAGCGCCA TCTGGTCACG CTGGCCTTGC TGGCGGCGGT GATCCTCGCG 
CTCTTCTGGC GCGACGCCGC CGACATGGCG GGCATCTGGT GGAACAGTTC GACCTTTACC
CATTGCCTGC TGATGGTGCC GCTGATCGGC TGGCTGGTCG CGCAGCGGAT CGACCTGTTG
CGCCCGCTGG CGCCGACCTT CTGGTGGCCC GCGTTGCTGT GGATGACGGG GGCGGGGTGC
GTCTGGCTCG TCGGCGAGGC GGCGGGGGTG GCGTTGTTCC GCCAGCTCGG CCTCGTGCTG
ATGCTCCAGG GCGCGGTGGG CGTCGCGCTC GGCGAAAAGC TGGTGCGCGG GCTGCTCTTC
CCGCTCGCCT ATGCGCTGCT GCTCGTTCCC TTCGGCGAGG AGCTGGTGCC GCTGCTCCAG
ACCTTTACCG CACATATCAG CGTCGCGCTG CTCCACCTGT CGGGCTTCGC TGCCGAGATG
CAGGGCGTGT TCATCACCAC GCGCGCGGGT TTTTTCGAGG TGGCCGAGGA ATGTTCGGGA
GTCAATTTCC TGATCGCGAT GCTCGCCTAT GCCGCGTTCG CGGCGCATCT TTGCTTCAAG
AGCTGGACGC GGCGGATCGT CTTTGTCGTC GCCGCGCTCG CGACGACGAT CCTGGCCAAT
GCGCTGCGCG CCTATGGGAC GATGCTCGCG GCCGAGGTCT GGGGGATCGA GGCGGCGGGC
GGGATCGACC ATATTGTCTA TGGCTGGATC TTCTTCGGCC TCGTCATCCT GATCGTGATG
CTGGCGGCGC GGCGCTGGTT CGACCGGCCC GCGAACGACG CCGCCGTGGA TGTGCGTGGG
CTGGAGGGGC CGATCCGTTT TGCGGGGCCG GGCCGCGCGG TGCTGCCCGC CGCCTTGCTC
TTGCCGCTGC TGTTCGCCGC ATGGGGTCTG CTCGTCGCCG GGCGCAGCGC GCCGTTGCCG
CCCACCATGG CGGTCGAGGC GCCGCCGGGC TGGCGCGAAG CGCGTGTCGG CGGCATCCCC
TGGACGCCGC GCTTCGACGG CGCCGACCAG CGGCTTTTGC GGCACTTTGC GAACGCGAAG
GGGCAGGTCG TGACCGTCGC GATCGGCGGT TATGAGCGCC AGGCCGAGGG GCGCGAGGTC
GTCGCCTTCG GGCAGGGCGC GGTCGATCCC GACAGCCGAT GGGCGTGGAG CGCGGCGCTG
CCCGCCGTCG ATGGGGGCAA GACCGAACGC CTGCTGCACC CCGGCCCGGT GCTGCGCGAC
GCCGCGACCT GGTATGTCGT CGGCGGCGAC GTGACGGGCA GCGCGCGTTC GGCCAAGCTG
GCGGGGCTGA AAGCGCGCCT TGCCGGCGGC GACCCGCGCG CGCTGACGCT GATCGTGTCG
AGCGAGACGG CGCAGGGCGG GCGCGATGCG ATCAGGGATT TCGTTTCCGC GTCGGGCGGA
GCCAGGGCGA TGGCTGACCG CGCGCTCAAA AGCCGCTAG
 
Protein sequence
MTRWQRHLVT LALLAAVILA LFWRDAADMA GIWWNSSTFT HCLLMVPLIG WLVAQRIDLL 
RPLAPTFWWP ALLWMTGAGC VWLVGEAAGV ALFRQLGLVL MLQGAVGVAL GEKLVRGLLF
PLAYALLLVP FGEELVPLLQ TFTAHISVAL LHLSGFAAEM QGVFITTRAG FFEVAEECSG
VNFLIAMLAY AAFAAHLCFK SWTRRIVFVV AALATTILAN ALRAYGTMLA AEVWGIEAAG
GIDHIVYGWI FFGLVILIVM LAARRWFDRP ANDAAVDVRG LEGPIRFAGP GRAVLPAALL
LPLLFAAWGL LVAGRSAPLP PTMAVEAPPG WREARVGGIP WTPRFDGADQ RLLRHFANAK
GQVVTVAIGG YERQAEGREV VAFGQGAVDP DSRWAWSAAL PAVDGGKTER LLHPGPVLRD
AATWYVVGGD VTGSARSAKL AGLKARLAGG DPRALTLIVS SETAQGGRDA IRDFVSASGG
ARAMADRALK SR