Gene Sare_2452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2452 
Symbol 
ID5707697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2812244 
End bp2813404 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID641271919 
Productaminotransferase class I and II 
Protein accessionYP_001537290 
Protein GI159038037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA GTCGTCTACG CGACATCCCA GGTATCGGCG TTGACGTGGT CGGCGACGCC 
GCGGACGCGG TAGCCGACCC GGACTTTCTC CGTCTGGAGA ATCTGGACAC CGACGTGCGG
CCGCCCACCG TGGCGGTCAC CTCGACCCGA GCCGCCATCG ATGACGATGC GGCCAACAGC
TACCTGCCCT TCCAGGGGCA CCGTTCCCTT CGTTCGGCTG CCACGGCTCA TGTGGGACGC
ATCGCTGGAC GCAGATTTGA TCCCGGCACC GAGTGCGTCA GCGTCGCCGG CGGGCTGAAC
GGCATCTTCA ACGCGCTACT TGCCACGGTG GAACCGGGCC AGGAGGTGGT GCTGGTCGAC
CCCATCTACG CCGGCCTGGT CAACCGGGTC CGGCTTGCCG GTGGGGTGCC TCGCTTCGTG
CCAGCCCGCG CCACGCCGGA TGGCTGGTCG GTGGATCCGC AACGCCTGGC CAGCGCTGTT
GGACCCGATA CCGCTGCCGT GCTGATGATG GGCCCGGCCA TGCCGTCCGG TCTGGTGTTG
GACACGCAGC ACTGGTCGGC GTTGTCCGAT GCCTGCGAGA GACACAACGC CTGGCTGATC
TATGACGCGG CGATGGAGAG AATCCGGTTC GACGGTCGCC GCCCCAGCCA TCCGGCGGCC
CATGACGGCC TCGCCGGACG TACCATCACG GTGGGTTCGG CATCCAAGGA ACTGCGGCTG
ATCGGCTGGC GGGTCGGTTG GGTCGTGGGG CCGGCGGACA TCCTCGCCGA CATCAAGCTG
GTTGGCCTGA CAAACGTGGT CTGCCAGGTC GGGCTGGCGC AAGGTGCGGT GGCTGCCGCG
CTCGACGCAC CCGATGCCGA CGCCGATGTT GCCGCCGCCA CCCGTGAGTG GCAGCGGCGC
TGCGACACGA TCCTCAACCA GCTTTCGGAC TATCCGATCA TTCGGCCACA CGGCGGCTGG
TCGCTCCTCG TGGACACCCG GCCCCTGGGC CTGACGCCCA CCGCGCTTGC CCGGCTACTG
TTCGACCGGG CCAAGGTCGC GGCCACGGCG ATGGATGGCT GGGGCCCCAG CGGCGAGCAC
TACCTGCGGA TCGTGTTCGC CAACGAACCG GTCGAGCGGC TCACCAGCCT GGCGGACCGC
TTCCGCCAGG CCATCGGGTG A
 
Protein sequence
MTVSRLRDIP GIGVDVVGDA ADAVADPDFL RLENLDTDVR PPTVAVTSTR AAIDDDAANS 
YLPFQGHRSL RSAATAHVGR IAGRRFDPGT ECVSVAGGLN GIFNALLATV EPGQEVVLVD
PIYAGLVNRV RLAGGVPRFV PARATPDGWS VDPQRLASAV GPDTAAVLMM GPAMPSGLVL
DTQHWSALSD ACERHNAWLI YDAAMERIRF DGRRPSHPAA HDGLAGRTIT VGSASKELRL
IGWRVGWVVG PADILADIKL VGLTNVVCQV GLAQGAVAAA LDAPDADADV AAATREWQRR
CDTILNQLSD YPIIRPHGGW SLLVDTRPLG LTPTALARLL FDRAKVAATA MDGWGPSGEH
YLRIVFANEP VERLTSLADR FRQAIG