Gene Sare_3976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3976 
Symbol 
ID5705253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4517530 
End bp4518591 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID641273401 
Producthypothetical protein 
Protein accessionYP_001538757 
Protein GI159039504 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0026302 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCGGGTC AGGGTGCGAT CGTCTATGTC GGTTGCTACA CGGTGGGTGC CGGAGGCCAC 
GGCGAGGGGA TCGTTGCCGC CCGTCGCGAC CTTGTCTCGG GTGCGCTCAC CCCGCTCGGC
ACGGCCGCGG CGACTCCGGC GCCGTCGTTC CTCGCCCGGC ACCCGGAACT GCCGGTCCTG
TACGCGGTCA ACGAGGTGAC CGACGGTGCC GTCAGCGCGT TCCGGGTCGC CTCGGACGGG
GGCCTGACCG CGCTCGGCAG CCGACCCACG GGGGGCGCCG AGCCGTGCCA CCTCGCGGTT
GCCCCGAACG GCCGGCACCT TTTCGTGGCC AACTACGGTG GCGGGAGCGT GGCGGTGTTT
CCGCTCGATG GGCAGGGGAT GCCCGGGGAA CGCACTGACC TGGTTCAGCA TGAGGGCCAC
GGCCTGGATC CGGAGCGGCA GCAGACGGCG CACACCCACA TGGTCGCCCC GGGCCGGGAC
GGTTGGCCGC TGTTCGTGGT CGATCTCGGC ACTGACTCGG TCTACCTGTA CGAGTTCGAC
GCCGCGCTGG GGCGGCTGGC GCCCCGGGCT TGCCGGGTGC CCACCGCCGC CGGTACCGGT
CCACGGCATG TGGCCCGCCA CCCGGACGGG CGGCGCTGCT GGCTCGTCGG TGAGCTGGAC
GGTTCTGTCG TCACCTACGA GTTCACCACC GAGGGTGCCC TGCGTCAGCG CGGTCGGGTG
TCAGCCAGCG AGCGGCCGGG GCACATACAG CCCTCGGAGA TCGCGGTCGG GCCGGACGGG
CGGTTCCTCT ACGTCGCGAA CAGGGGTGTC GGCACGATCG CCGTCTTCGC GCTCGACGGC
GAACTGCCGG TGCGGGTCGC CGAGGTCGAC TCCGGCGGGG AGTGGCCCCG GCATTTCGCG
CTGGTGGGCC CCAACCTGTA CGTGGCGGAC GAGCGGGCCG ACCTGATCGC GGTGTTTCGG
GTTGACCCGG TGACCGGTGT GCCCGTACCG GCCGCTGAGC CGGTTGCTGT CCCGAGCCCC
ACCTGTGTCC TGCCCTGGAC GGGACACGAC GACGCATCGT GA
 
Protein sequence
MAGQGAIVYV GCYTVGAGGH GEGIVAARRD LVSGALTPLG TAAATPAPSF LARHPELPVL 
YAVNEVTDGA VSAFRVASDG GLTALGSRPT GGAEPCHLAV APNGRHLFVA NYGGGSVAVF
PLDGQGMPGE RTDLVQHEGH GLDPERQQTA HTHMVAPGRD GWPLFVVDLG TDSVYLYEFD
AALGRLAPRA CRVPTAAGTG PRHVARHPDG RRCWLVGELD GSVVTYEFTT EGALRQRGRV
SASERPGHIQ PSEIAVGPDG RFLYVANRGV GTIAVFALDG ELPVRVAEVD SGGEWPRHFA
LVGPNLYVAD ERADLIAVFR VDPVTGVPVP AAEPVAVPSP TCVLPWTGHD DAS