Gene Sare_0745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0745 
Symbol 
ID5707777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp828194 
End bp829825 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content70% 
IMG OID641270264 
Productalpha amylase catalytic region 
Protein accessionYP_001535655 
Protein GI159036402 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.215044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000627206 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACCC GCAATCCCAC CCAGCTGGCC TCCGACGATG ACTGGTGGCG CTCTGCGGCC 
ATCTACCAGG TCTACGTCCG CAGCTTCGCC GACGCGAACG GCGACGGGAT CGGCGACCTG
GCGGGCCTGC GGGAACGCCT GCCCTATCTG CGCGACCTCG GAGTCGACGC GCTCTGGTTG
ACCCCCTTCT ACCCCTCGCC GATGATCGAC GGCGGCTACG ACGTCGCCGA CTACCGCGAC
GTGGACCCGC TGTTCGGCAC CCTCACCGAC TTCGACAACG TGATCGTCGA CGCGCACGCG
CTGGGGCTGC GAATCATCGT CGACCTGGTA CCCAACCACA CGTCGAGCGA GCATCCCTGG
TTTCAGGCCG CCCGCACGGC CGCCCCCGGC TCTCCCGAGC GCGCGAGGTA CGTCTTCGCC
GAAGGGCGAG GCGTGGCGGG CGAGTTGCCG CCGAACGACT GGGAGAGCGT CTTCGGCGGT
CCGGCCTGGA CCCAGCTCAC CGACGGACAG TGGTACCTGC ACCTGTTCGA CCCGGCGCAG
CCGGACCTGA ACTGGCGAAA TCCCGAGGTC CGCGCCGAGT TCGCGGACGT GCTCCGATTC
TGGCTGGACC GTGGTGTCGA CGGCTTCCGG ATCGACGTCG CCCACGGGTT GATCAAGGCG
GAGGGCCTGC CGGACGTCGG CTTCGGCCAG CTGACCGGCC AACGACAGGT CGAACTGCTC
GGCAAGCGCC GGCTCCCCTA CTTCGACCAG GACGAGGTAC ACGACATCTA CCGCACCTGG
CGACCGATTC TGGACAGCTA CCCAGGCAGC CGAATGGCAG TGGCCGAGGC GTGGGCGGAG
ACGCCCCAAC GGCTCGCCCG CTACATCGGC CCGGACGAGC TGCACCAGGC GTTCAGCTTC
GACTTCCTCG ACGCCCACTG GTCGGCCGAC TCCTTCCGCA AGGTCATCGA CACCGCGCTC
GCCGAGTCCG TGATCGTCGG CGCACCCACC ACCTGGGTGC TGTCCAACCA TGACAAGCAG
CGACACGTCA CCCGCTACGG AGATGGCCCG GAAGGGTTGC GTCGGGCACG TGCCGCGGCT
CTGCTGATGC TCGCGCTACC TGGCTGCGCC TACCTCTACC AGGGTGAGGA ACTGGGCCTG
CCCGAGGTGC TCGACCTACC CGACGACCTG CGACAGGACC CGCAGTACCT GCGGACCGGC
CAGAGCCGGG ACGGCTGTCG GGTGCCGATC CCGTGGAACG GCGAGCTGGC CCCGTACGGC
TTCGGCCCGG CCGGCAGCGC GTTGAGCTGG CTGCCGGCCC CGGCGACCTG GCGCGAGCTG
TCGGTGCGGG CCCAGACCGG CGTCTCCGAC TCCACCCTGG AGCTGTACCG CACTGCGCTA
CGGATTCGTC GGACCCACCC GGCGCTGTCC GTATCCTCGG CCAGGATCAC CTGGCAGGAG
ACCGATCCGG GAATCCTCGC CTTCACCCGC TCGGCCGCCG GCACGGCGCT CACCTGCGTG
GTCAACATCA GCGACGAACC GGCCGCCGTC GCCGAGTACG GCGAGCCGCT CGTCGCCAGC
ACGGCGCTCA CCGAACGGGA CGCCGGCTAC CTGGTTCCGG TCGACGCCGC CGCCTGGTTC
GAACGCCGCT GA
 
Protein sequence
MTTRNPTQLA SDDDWWRSAA IYQVYVRSFA DANGDGIGDL AGLRERLPYL RDLGVDALWL 
TPFYPSPMID GGYDVADYRD VDPLFGTLTD FDNVIVDAHA LGLRIIVDLV PNHTSSEHPW
FQAARTAAPG SPERARYVFA EGRGVAGELP PNDWESVFGG PAWTQLTDGQ WYLHLFDPAQ
PDLNWRNPEV RAEFADVLRF WLDRGVDGFR IDVAHGLIKA EGLPDVGFGQ LTGQRQVELL
GKRRLPYFDQ DEVHDIYRTW RPILDSYPGS RMAVAEAWAE TPQRLARYIG PDELHQAFSF
DFLDAHWSAD SFRKVIDTAL AESVIVGAPT TWVLSNHDKQ RHVTRYGDGP EGLRRARAAA
LLMLALPGCA YLYQGEELGL PEVLDLPDDL RQDPQYLRTG QSRDGCRVPI PWNGELAPYG
FGPAGSALSW LPAPATWREL SVRAQTGVSD STLELYRTAL RIRRTHPALS VSSARITWQE
TDPGILAFTR SAAGTALTCV VNISDEPAAV AEYGEPLVAS TALTERDAGY LVPVDAAAWF
ERR