Gene Sare_2729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2729 
Symbol 
ID5704470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3110474 
End bp3111610 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content73% 
IMG OID641272185 
Productglutamate--cysteine ligase GCS2 
Protein accessionYP_001537555 
Protein GI159038302 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.884579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.170085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTACC GTCCGGCCAC CGACGGTCCC GATCTGGCCA GCCTCACCCT TGGTGTCGAG 
GAGGAGTTTC TGCTGCTGGA CGCCGAGACG GGGGAGAGCA TGCCGGTTGC CGCTCGGGTG
CTCGACGGGT TGTCCGGTGT CGCCCACGCG CAGAGCCGGC GGGAGTTTCG GCACAGCATG
GTCGAGATGG TCACGCCGGT CCTGTCCGAC CTGGCGGAGC TGCGGCGGCA TCTGGTGGCG
CTGCGCACCG CCGCCGCTGA TGCGGCCGAG GCCGCTGGTG CCCGCCTTGT CGCGGTGGGC
GCGACCCCGG TGAACGAGAC GCACCGGACC GTACCGGACG AACCCCGGTA CCACGCGATG
TCCCGGCGCT TCGGGCCGGT CGCGCACGAC CCGGCCGTCT GTGGCTGCCA CGTGCACGTC
GGACTGCCGG ACCGGGAGCT GGCGGTCCAG GTCTGTAACC ACCTGCGCCC GTGGTTGCCG
GTCGTGCAGG CGATCACCGC CAACTCGCCG CTGCACGACG GCCAGGACAC CGGGCACGCG
AGCTGGCGGG CGATGCAGCT GGAGCGCTGG CCGAGTATCG GCCCCACCCC GTACTTCGAC
TCGGCCGCCG ACTACGACGC CACCGTGGCG GATCTGATCA AGGCGGGGAT CATGCTCGAC
GCCGGGATGG TCTACTGGTA CGTCCGACCG TCCGCCGCGT ATCCCACGGT CGAGATTCGG
GTCGGGGACG TCTGTCCCAC GGTCGACGAC ACGGTGCTGG TGGCCGGGCT GGTGCGGGCC
CTCGTCGCGA CCGTCGCCGC CGATGTCCAC GACGGCGCCC GTGCGCCGCG GATCCGTGGC
TGCCTGCTCT CCGCCGCCCA CTGGCGAGCC GCCCACGACG GGCTCGACGG CGACCTCGTC
GACCTGCGTA CCGGGCGCGC CCGGCCGGCC TGGGACCTGG TGGACGACCT GTTCGCACTC
GTCACGCCGG CGCTGGAACG CCAGGGTGAC CGGGCGTACG TGCGAGACCA GCTGGCCCGG
GTCCGGGCTG AAGGCACCGG CGCGGTACGG CAGCGCCGGA TCCTGGACCG CAGCGCCTGT
GACGTCCGCG CCGTGCTGGA CCACCTGGCC GCACAGACCC GACCCGCCCC GGTGTGA
 
Protein sequence
MTYRPATDGP DLASLTLGVE EEFLLLDAET GESMPVAARV LDGLSGVAHA QSRREFRHSM 
VEMVTPVLSD LAELRRHLVA LRTAAADAAE AAGARLVAVG ATPVNETHRT VPDEPRYHAM
SRRFGPVAHD PAVCGCHVHV GLPDRELAVQ VCNHLRPWLP VVQAITANSP LHDGQDTGHA
SWRAMQLERW PSIGPTPYFD SAADYDATVA DLIKAGIMLD AGMVYWYVRP SAAYPTVEIR
VGDVCPTVDD TVLVAGLVRA LVATVAADVH DGARAPRIRG CLLSAAHWRA AHDGLDGDLV
DLRTGRARPA WDLVDDLFAL VTPALERQGD RAYVRDQLAR VRAEGTGAVR QRRILDRSAC
DVRAVLDHLA AQTRPAPV