Gene Sala_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3167 
Symbol 
ID4082503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3317407 
End bp3318717 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content68% 
IMG OID638011552 
Productamino acid permease-associated region 
Protein accessionYP_618203 
Protein GI103488642 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATC CCGCAAATCG CCACGAAAGG CCGCGCCGCA AGCTCGGCCT TTCGATGGCG 
ATTGCGCTGG TCATGGGCAA TATGATCGGC TCGGGGGTGT TTCTGCTGCC CGCGAGCCTC
GCCCCCTTTG GCTGGAACGG TGTCGCGGGC TGGGCGATCA CGATAGGCGG CGCGCTCGCA
CTCGCCTTCG TCCTCGCGCG GTTGACCGCC CTCCACCCCG ACGCGGGCGG GCCAACCGGC
TTCGTCGAGC GCGCCTTTGG CCGCATCCCC AGTTTCATGA TCGGCTGGGC CTATTGGGTG
TCGGTGTGGA CCGCGAACGT GACGCTCGCG GTCGCGGCGG TGAGCTTCCT CAGCCTGTTC
GTGCCGGCAC TGGGGCAGCA TACGGCGCTG TCGACGATCG CGCTGATCTG GATCGTCACC
GCGATCAACT GGCGCGGCGC GCGCGCGGCG GGACAGTTTC AGGTCGTGAC CCTGCTCATC
AAGCTGATCC CGCTCGTCAC CGTCATCATC CTGATCCCCA TCGCCTTTGG CCGCAGCGAG
CCCGTCGCGC TCACCCCCTT TCCCGCCGAC GGGCTGTCGC TCGCGGCCGT CAGCGGGTCG
GCGATCCTGA CGCTCTGGGC GCTGCTGGGT TTTGAATCGG CGAGCGTCGC CGCCGACAAG
GTCGCCAATC CCGCCGTCAC CATCCCGCGC GCGACCATCG TCGGCACGCT CGCGACGGGC
ATCCTCTATC TGATCGTCTG TTCGGCGATC GCGCTGATGC TGCCCGCGGC GGAAGTCGCG
AAATCGGAGG CCCCCTTTTC GCTGTTCGTC GAAACCTGGT GGGGCCGCGA GCCTGCGCTG
TTCATCGGCG CCTTTGCGGC GGTCAGCGCG CTGGGCACGC TGAACGGCTG GACGCTGATC
CAGGCGGAGC TGCCCGCGAC GCTCGCACGG CAGGGGTTGC TCCCTTCATG GTTCGGGCGT
GAGAACCGCC ATGGCACGCC GACCGCGGCA CTGCTGCTGT CGAGCGCGAT CGCCACCGCC
TGCGTCCTCC TCAACAGCAG CAAGTCGACG AGCGAGATGT TCACCTTCAT GGCGGTGCTC
TCGACCTCGG TGACCCTGTG GCTCTACCTC GCCTGCGCCG CCGCGGCGCT GCGGATGCGC
GTCGCGATCC CGGTCGCGCT GATCGGCCTC GTCTATGCCG TCTGGACTTT GTGGGGTGCG
GGGATCGGTG TCAGCGCGAT GAGCCTCATA TTGATGGCCG CAGGGTTGCC GCTCTACGCT
TGGACCATGC TGTCAGCGCC AGCGGGGCGC GAAGAGCCCC CGGTCGCGTA G
 
Protein sequence
MSDPANRHER PRRKLGLSMA IALVMGNMIG SGVFLLPASL APFGWNGVAG WAITIGGALA 
LAFVLARLTA LHPDAGGPTG FVERAFGRIP SFMIGWAYWV SVWTANVTLA VAAVSFLSLF
VPALGQHTAL STIALIWIVT AINWRGARAA GQFQVVTLLI KLIPLVTVII LIPIAFGRSE
PVALTPFPAD GLSLAAVSGS AILTLWALLG FESASVAADK VANPAVTIPR ATIVGTLATG
ILYLIVCSAI ALMLPAAEVA KSEAPFSLFV ETWWGREPAL FIGAFAAVSA LGTLNGWTLI
QAELPATLAR QGLLPSWFGR ENRHGTPTAA LLLSSAIATA CVLLNSSKST SEMFTFMAVL
STSVTLWLYL ACAAAALRMR VAIPVALIGL VYAVWTLWGA GIGVSAMSLI LMAAGLPLYA
WTMLSAPAGR EEPPVA