Gene Sare_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1106 
Symbol 
ID5706671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1244276 
End bp1245451 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content74% 
IMG OID641270621 
Productaminotransferase class V 
Protein accessionYP_001536005 
Protein GI159036752 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00753492 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCATACC TGGATCACGC GGCGACCACT CCGATGCTCG ACGTGGCACT CGAGGCGTAC 
GTCGCCACCG CCCGCGAGGT CGGCAACGCA TCCGCCCTGC ACGTGGCGGG CCGCCATGCC
CGCCGCCGAG TCGAGGAGTC GCGTGAGCGG GTGGCCGCCG CGCTGGGCGC CCGCCCCGCC
GAGGTCATCT TCACGGGCGG TGGCACGGAA AGCGACAACC TCGCGGTCAA GGGCATCTTC
TGGGCCCGTC GGTCCGCCGA CCGTCGGCGG ACGCGGCTGG TGTCCAGCGC GGTGGAGCAT
CACGCGGTGC TCGACAGTGT CGACTGGCTG GCCGCGCACG AGAGCGCCGA GGTGGGTTGG
CTGCCGGTCG ATGCCCTCGG CCGGGTCACC CCGCAGCGGC TCCGCGCCGA GTTGGCCGCG
TCCGCCGATC GGGTGGCCGT GGTCACGACG ATGTGGGCCA GCAACGAGGT GGGTACGATC
CAGCCGGTCA CCGAACTGGC CGAGGTCGCG GCCGAGTACG GGGTGCCCTT TCACACCGAC
GCGATCCAGG CGGTCGGCCA GGTGGCGGTG GACTTCGCCG CCAGTGGCGT CTCGGCGCTC
ACGGTGACCG GGCACAAGCT CGGCGGTCCC GCCGGGGTGG GCGCACTGGT GCTCGCCCGC
GACGTCGCCG CGACCCCGCT CCTGCACGGT GGTGGCCAGG AACGGGACGT CCGTTCGGGA
ACCCTGGACA CGGCCGGGAT CGTCGCCTTC GCCGCCGCGC TGGAGGCCGC GGTCCAGCAC
CAGCAGGAGT ACGCGACCCG CGTCGCCGCC CTTCGGGACG ACCTCGTGGC ACGGGTGCGG
CAGGTGGTGC CGGAGGCGGT GCTCAACGGT GACCCAGCCG GACGGCTGCC CGGCAATGCC
CACTTCTCGT TCCCCGGGTG CGAGGGCGAT GCGCTGCTGC TCCTCCTCGA CGCGCAGGGC
ATCGCCTGCT CCACCGGCTC GGCGTGCTCG GCCGGCGTCG CCCAGCCGAG CCACGTGCTG
CTCGCGATGG GCGCCGATGG CGCCCGCGCC CGCTCCTCAC TGCGCTTCAC CCTCGGCCAC
ACCAGCACAC CGGAGGAGGT CGACGCGCTG ATCGCGGCCC TACCGGAGGC GGTCGATCGA
GCCCGTCGCG CCGGCGGCCT CCGCGCTCCG CGCTGA
 
Protein sequence
MAYLDHAATT PMLDVALEAY VATAREVGNA SALHVAGRHA RRRVEESRER VAAALGARPA 
EVIFTGGGTE SDNLAVKGIF WARRSADRRR TRLVSSAVEH HAVLDSVDWL AAHESAEVGW
LPVDALGRVT PQRLRAELAA SADRVAVVTT MWASNEVGTI QPVTELAEVA AEYGVPFHTD
AIQAVGQVAV DFAASGVSAL TVTGHKLGGP AGVGALVLAR DVAATPLLHG GGQERDVRSG
TLDTAGIVAF AAALEAAVQH QQEYATRVAA LRDDLVARVR QVVPEAVLNG DPAGRLPGNA
HFSFPGCEGD ALLLLLDAQG IACSTGSACS AGVAQPSHVL LAMGADGARA RSSLRFTLGH
TSTPEEVDAL IAALPEAVDR ARRAGGLRAP R