Gene Sare_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1821 
Symbol 
ID5706467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2098714 
End bp2099919 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content65% 
IMG OID641271323 
Productaminodeoxychorismate lyase 
Protein accessionYP_001536698 
Protein GI159037445 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000914518 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGACG ATCTGGACCC CGAGTTCGAT GCGGACCGGG GAGAGAAGGG CCGGCATCGG 
CGCCGCTACG TGCGCAGGCG CCAGCGTCAG CGCCGGAGCG GTTCGGGTGG TGGGCGTGGC
AAGACCGCCC TGGCTCTGTT GCTCACCCTG GTTTTGCTCG GCGGCCTCGG CGGTGGTGCC
TTCTACGGCT TCGAACGGAT CCAGAACTTC CTCGGCACGC CGGATTACGA CGGTTCTGGC
ACCGAGGCGG TGACGGTCGA GATCATGGAA GGGGCATTGA TCGCCGACAT GGCGGTCACG
CTCTACGAGG CCGGGGTCGT CAAGAGTACC AAGGCTTTCA TCGAGGCCGC GGAGGATGAC
GGCCGCAGCA AGACCATCCA GCCAGGCCAG TACCAGTTGC GCAGGCAGAT GAGTGGCGCC
AGCGCCGTGG CCGCGCTGCT GGACCTGACG AACCGGGTCG TCAACGGGAT CACCATTCCC
GAGGGGCGCA CCGCGAAGAG CGTCTACAAG CTCCTCTCCG AGAAGACCAA CGTCCCGGTC
ACGGAGTTCG AGGCGGCGGC GAAGGACCCG ATCGCGCTCG GTGTCCCGGA ATGGTGGTTC
ACGCGCACGG ACGACCGGAA GGTCGAGCCG TCGATCGAGG GATTCCTCTT CCCCGACACC
TACGAGTTTC CCCCGAAGTC AACGGCTGAG TCGATCCTTG GGCTGATGGT GGAGCGGTTC
CTCACCGTCG CCGAGGAGCT GCGGTTCGTC GACCGGGTGC AGAACGAACG GCAGATCGCG
CCGTACGAGG CGCTGATCGT CGCGTCGCTC GCCCAAGCTG AGGCGGGTGT TCCGGGGGAT
CTCGGCAAGG TCGCCCGGGT CGCCTACAAC CGGGTCTACG GCGACTTCCC GTGCAACTGC
CTGGAGATGG ACGTCACGAT CAACTACCAC CTGGAGTTGA CCGGCCAGAA GACCAAGACC
TCGGCCGAGA TGACGGAGGA CGAGCTGCTC GACACAAAGA GCCCGTACAG CCGCAAGCTT
CGGGGTCTGA TTCCCACACC GATCAACAAT CCGGGTCAGT TGGCCCTGGA GGGCGCCATG
GACCCGCCGC CGGGTAAGTG GCTGTACTTC GTTGCGATCA ACAAGGAGGG ACAGTCCGCC
TTCGCGGAGA CCTACGAGGA GCAGCTGCGC AACGAGGCAA AGGCGAGGGA GGCGGGTGTC
ATCTGA
 
Protein sequence
MIDDLDPEFD ADRGEKGRHR RRYVRRRQRQ RRSGSGGGRG KTALALLLTL VLLGGLGGGA 
FYGFERIQNF LGTPDYDGSG TEAVTVEIME GALIADMAVT LYEAGVVKST KAFIEAAEDD
GRSKTIQPGQ YQLRRQMSGA SAVAALLDLT NRVVNGITIP EGRTAKSVYK LLSEKTNVPV
TEFEAAAKDP IALGVPEWWF TRTDDRKVEP SIEGFLFPDT YEFPPKSTAE SILGLMVERF
LTVAEELRFV DRVQNERQIA PYEALIVASL AQAEAGVPGD LGKVARVAYN RVYGDFPCNC
LEMDVTINYH LELTGQKTKT SAEMTEDELL DTKSPYSRKL RGLIPTPINN PGQLALEGAM
DPPPGKWLYF VAINKEGQSA FAETYEEQLR NEAKAREAGV I