Gene Sare_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2039 
Symbol 
ID5705693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2334888 
End bp2336270 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content67% 
IMG OID641271529 
Productaminotransferase class V 
Protein accessionYP_001536900 
Protein GI159037647 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000739402 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCGGCCC GGACCGCCTG TGTGGCCTCC GGACCCGCCG TCGGCTACGA AGGGATCACC 
GTGAGCCATA CCGTGGACTC CCCCGGCTCG GCCACAGTCC CGGCGTCGGT GCCACCGAGC
CCGTTGCGCA CCGCGTCGTG GAGTCGCATC CGGGAACTGT TCGCCCTGGA CCCGACGACC
GTGCATCTCA ACACCGGAAC CGTCGGGGCC ATGCCGTACG AGGTCCTCGA CACCGTCGAC
CGGGTCACCC GACAGTGGAC CGGCGGACTC CTGGACGTCT ACCGCCCGGC CATGTTCACC
GAGTACCGGG CCTTCATCGG CACGACGTTC GGAGTGGACG AGGACGAGAT CGTCATCTGC
CACAACGCGA CCGAGGGCGT CGCTCGGGTC ATCCACGGGC TGGACCTGCG CGCGAGTGAC
GAGGTGGTGA CCACCACCCA CGAGTGTTAC TCGGTGCTGT CCAACTTCAA CTTGCTGCGC
AACCGGCACG GCATTGTGGT ACGCACCGTC ACCCCGCCGT CGGGCCACGA CCTACGAGCT
GAGGAGATCG TCGATCTGGT CGAGTCGGCG ATCACGCCGC GTACCAAGGT GCTGTCGTTC
GCGGCGATCA CTCTCTTCAC CGGCACCATG TTTCCCGTCC GGCAGCTGTG CGAGCTGGCT
CACCGGTACG GCCTGACCAC CGTCATCGAC GGCGCCCTGA TCCCCGGCAT GTTCGACGTG
AACCTACGCG ACTACGGCGC CGACTTCATC ACCTGCTCCG GCTCGAAGTT CCAGTGTGGG
CCCCTCGGCA CCGGCCTCAT CTACGTCCGC AACAAGGTCG TCCCCGAGTC CAACCCGCTG
CCGTTGCCCA CCTTCTGGCC GCTCATCTCC ACCTGGTACC CGATGATGGG CACTCCGCCG
CCGCGTACCA CCAACGAGGT GGCCAGCTAC AACATGGGCG ACTACCTGCA AAGCGCCGGG
AGCGCCAACC TGGCTCGTGG CGCCGCGCTG ACCCGCGCCT TCGAGCTGTG GGACGACATC
GGGCGGGACC GCATCGAGCG GTACGTCATG GAGCTCGCCG AGTACGCGCG CGGCCGACTG
ATCGAGGCTT TCGGCGAGGA GGCCATGTAC TCCCCCGGCG CCGACCCACG GTTGCGCTCA
CCGCTGATCG CGTTCAACCC GTTCCGCCGC GCCGAGGACG CCTGGAACAT CAAGAAGTTC
GTCACCTTCG TCAAACGACT GGAGACCGAG CACCGGATCT GGACCCGTTG GACCGAGTTC
GACGTCCCCG GATCGCCGCA CCAGCACTAC GCGGCACGCA TCACCACGCA CCTGTTCAAC
ACGCGTGGAG AGATCGACCA CAGCGTCCGG ACGATGGTCC GCCTTGCCGA GGAGATGTCC
TGA
 
Protein sequence
MSARTACVAS GPAVGYEGIT VSHTVDSPGS ATVPASVPPS PLRTASWSRI RELFALDPTT 
VHLNTGTVGA MPYEVLDTVD RVTRQWTGGL LDVYRPAMFT EYRAFIGTTF GVDEDEIVIC
HNATEGVARV IHGLDLRASD EVVTTTHECY SVLSNFNLLR NRHGIVVRTV TPPSGHDLRA
EEIVDLVESA ITPRTKVLSF AAITLFTGTM FPVRQLCELA HRYGLTTVID GALIPGMFDV
NLRDYGADFI TCSGSKFQCG PLGTGLIYVR NKVVPESNPL PLPTFWPLIS TWYPMMGTPP
PRTTNEVASY NMGDYLQSAG SANLARGAAL TRAFELWDDI GRDRIERYVM ELAEYARGRL
IEAFGEEAMY SPGADPRLRS PLIAFNPFRR AEDAWNIKKF VTFVKRLETE HRIWTRWTEF
DVPGSPHQHY AARITTHLFN TRGEIDHSVR TMVRLAEEMS