Gene Sala_0506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0506 
Symbol 
ID4081396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp523927 
End bp525147 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID638008864 
Productaminotransferase 
Protein accessionYP_615560 
Protein GI103485999 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0194715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.65304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACG ATGAATTCTA TCGCATCAAG CGCCTGCCGC CCTATGTCAT CGCCGAAGTC 
AATGCGATGC GTGCGGCCGC GCGTGCCGCG GGCGAGGATA TCATCGACCT GGGGATGGGC
AACCCCGACC TGCCGCCGCC CGACCATGTG ATCGACAAAT TGTGCGAAGT CGCGCGCAAG
CCCGATGCCC ACGGCTATTC GCAGTCGAAA GGCATCCCCG GTCTGCGTCG CGCGCAGGCC
AATTATTACG GCCGCCGCTT CAACGTCGAC CTCGACCCCG AAAGCGAAGT GGTGGTGACG
ATGGGGTCGA AGGAGGGGCT CGCGAGCCTC GCGACCGCGA TCACCGGCCC CGGCGACGTC
GTGCTCGCGC CGAACCCCAG CTATCCCATT CACACCTTCG GCTTCATCAT CGCCGGCGCG
ACGATCCGCA GCGTGCCGAC GACGCCCGAC GAGAATTACT GGCGCGCGCT CGACCGCGCG
ATGGCCTTCA CCGTGCCGCG CCCGTCGATC CTGGTGGTCA ATTATCCCTC GAACCCGACC
GCCGAGGCGG TCGATCTCGC TTTTTACGAA CGTCTTGTCG CCTGGGCGAA GGAGAATAAG
GTCTGGGTGC TCAGCGATCT TGCCTATTCG GAGCTTTATT ACGACGGCAA CCCGACGCCC
TCGATCCTGC AGGTGCCGGG CGCGAAGGAC GTCGCGATCG AGTTCACGTC GATGTCCAAA
ACCTATTCGA TGGCGGGCTG GCGCATGGGC TTTGCGGTCG GCAACAAGCG GCTGATCGCC
GCGATGACGC GCGTCAAATC CTATCTCGAC TATGGCGCCT TCACCCCCAT TCAGGCCGCG
GCGTGCGCGG CGCTCAACGG GCCGCAGGAC ATCGTCGAGA AGAACCGCCA GCTCTATCAG
AAGCGCCGCG ACGTGATGGT CGAAAGCTTC GGCCGAGCGG GATGGGACAT CCCCAGCCCG
CCCGCGTCGA TGTTCGCCTG GGCGCCGCTG CCGCCCGCGC TCAGGGAGAT GGGCAGCCTC
GAGTTTTCCA AACAGCTGCT GACCCACGCC AAGGTCGCGG TCGCCCCCGG CGTTGGTTAT
GGCGAGGATG GCGAGGGCTT TGTCCGCATC GCGATGGTCG AAAATGAACA GCGCATCCGG
CAGGCGGCGC GCAACATCCG CAAGTTTCTG GCCATGCACG GCGTGAACAC ACCTTCGATC
GCCGCCGGCG GCGCGGGATA A
 
Protein sequence
MSDDEFYRIK RLPPYVIAEV NAMRAAARAA GEDIIDLGMG NPDLPPPDHV IDKLCEVARK 
PDAHGYSQSK GIPGLRRAQA NYYGRRFNVD LDPESEVVVT MGSKEGLASL ATAITGPGDV
VLAPNPSYPI HTFGFIIAGA TIRSVPTTPD ENYWRALDRA MAFTVPRPSI LVVNYPSNPT
AEAVDLAFYE RLVAWAKENK VWVLSDLAYS ELYYDGNPTP SILQVPGAKD VAIEFTSMSK
TYSMAGWRMG FAVGNKRLIA AMTRVKSYLD YGAFTPIQAA ACAALNGPQD IVEKNRQLYQ
KRRDVMVESF GRAGWDIPSP PASMFAWAPL PPALREMGSL EFSKQLLTHA KVAVAPGVGY
GEDGEGFVRI AMVENEQRIR QAARNIRKFL AMHGVNTPSI AAGGAG