Gene Sare_2958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2958 
Symbol 
ID5707812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3355974 
End bp3357389 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content70% 
IMG OID641272407 
Productadenylosuccinate lyase 
Protein accessionYP_001537775 
Protein GI159038522 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.121926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00625137 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGTCA CGCACCGCCT CGTCCATTCC GTCGATAGCT GCGCTCACGA GCGGGGACAC 
ATCACCGACT CACGGTTCTA CGGCAACCGG TACGCCACGT CCGGCAGCCG GCGCATCTAC
TGCGACGTCT GCCGCAAGCA GCGCTGGCTG GACATCGAGG CGGCACTGGC CCAGGCCCAA
GGCGAGCTGG GCATGATTCC CGCTCGGGCC GTGGCGGGGA TCGTGTCCGC GGCCCGACTC
GAGTGCATCG ACCTCGACGC GGTGCAGGCG GAGATCGACC GGTCGGGGCA CTCCCTGGTC
GGGCTGCTCC GGGTCCTCCA GGTCGCCTGC CCTGACGACA CCGGCGAGTA CATCCACTTC
GGTGCGACCA CGCAGGACAT TCAGGACACG GGTCAGGCCC TCGAGATGCG GGACACCCTG
GACGAGTTGA CCCGGGAGAT CGCCGCGATC CTCGCCAGTC TGGTCGAGCT GGCCGAACAG
CACGCCGGGA CGGTCGCGGT CGGACGGACG CACGCCCGGG CGGCGCTGCC GATGAGCTTC
GGCCTCAAGG TCGCTAGCTG GATCGACGAG CTGTTACGGC ACACCGAACG GCTCGCCACG
GCACGATCCC GCGTCGTGGT GGCCCAGCTG TTCGGCGGCG CCGGCACGAT GGCCGGGTTC
GGCGGCGGCG GGGTTGTCCT GCTGGAACGC TTCGCCGCCC GCCTCGGTCT CGCCGTGCCC
ACCCTCGGCT GGCACGTCGC CCGCGACCGG GTGGTCGAGT TCGTCACCAC GCTGGCCATG
GTCAGCGGCA CGCTGGGCCG CGTCGCCGAC GAGATCCGCA CCATCGGTCG GCCCGAGTTC
GGTGAGGTCA CCGAACCCTG GCGGTACGGC AAGGTCGGGT CCAGCACGAT GCCGCACAAG
CGGAACCCGG AGCGCTGTGA GCAGGTCGTC GTGATGGCCA AGCTGGCGGC GGCCCAGGCG
GGGATCGCCT TCACCGCGAT GGTCGGTGAC CACGAGCGGG ACGCGCGTGC GCTGCGGGTG
GAGTGGGCGT GTGTCCCGGA CGTCTCGCAC TACACCCTGG CGGCCTGCGA GATCGTCCGG
GAACTGGTCA CGGGCCTGAC CGTGCACGAG GACCGGCTGC GGGTCAACGC GCAGGAGGTT
GCCGACCAGC TGGCCACCGA ACGGCTGATG CTGGCGCTCG GTCGACAGCT CGGCAAGCAG
ACCGCGCACG AGCGGGTCTA CGAACTCAGC CAGACCGCCC ACGACACCGG CCGCCCGCTG
CGCCACCTCT TCGACGACTG CGCCGACCTG CGCGACCTCG TCGACGAACG GGAGCTGGAT
GTCATCTTCG ACCCCAGTCA GTACCTGGGC GCCTCAACCG ACCTGACCCA CCGGGCCCTC
GCCGAGGCCC GTAAGTGGCT CGGGGCGCGC GGGTGA
 
Protein sequence
MSVTHRLVHS VDSCAHERGH ITDSRFYGNR YATSGSRRIY CDVCRKQRWL DIEAALAQAQ 
GELGMIPARA VAGIVSAARL ECIDLDAVQA EIDRSGHSLV GLLRVLQVAC PDDTGEYIHF
GATTQDIQDT GQALEMRDTL DELTREIAAI LASLVELAEQ HAGTVAVGRT HARAALPMSF
GLKVASWIDE LLRHTERLAT ARSRVVVAQL FGGAGTMAGF GGGGVVLLER FAARLGLAVP
TLGWHVARDR VVEFVTTLAM VSGTLGRVAD EIRTIGRPEF GEVTEPWRYG KVGSSTMPHK
RNPERCEQVV VMAKLAAAQA GIAFTAMVGD HERDARALRV EWACVPDVSH YTLAACEIVR
ELVTGLTVHE DRLRVNAQEV ADQLATERLM LALGRQLGKQ TAHERVYELS QTAHDTGRPL
RHLFDDCADL RDLVDERELD VIFDPSQYLG ASTDLTHRAL AEARKWLGAR G