Gene Strop_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2001 
Symbol 
ID5058464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2271513 
End bp2273282 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content67% 
IMG OID640474267 
ProductAlpha-amylase 
Protein accessionYP_001158833 
Protein GI145594536 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.357099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.053891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGACCA CGCTCGGTCC ACACCGCTGG CGTCGGCGTG CAGCCGGCCT CCTCGCCGCC 
GCTCTCGTCA CCGCCATCAC CGCCATCACC GCCACCTCGC TGCCGGCCAA CGTTCAGGCG
TCGCCGCCGG GCGACCGGGA CGTCACCGCC GTCCTGTTCG AGTGGCGCTT CGACTCGATT
GCCCGCGCCT GCCAGGACAC ACTGGGGCCC AAGGGATACG GCTTCGTGCA GGTCTCCCCA
CCGCAGGAGC ACATCCAGGG CTGGCAGTGG TGGACGTCGT ACCAACCCGT CAGCTACGAC
ATCTCCAGCC GGCTGGGTGA CCGGAACGCG TTCCGGGCCA TGACCGAGGC CTGCCACGGC
GCCGGAGTGA AGGTCATCGT GGACGCGGTC ATCAACCACA TGACCGCAGG ATCCGGCACC
GGCACCGGCG GCACTAACTA CAACAAGTAC GACTACCCAG GCTTCTACCA GGTCCAGGAC
TTCCACTCCT GCCGCAAACA CATCAGCGAC TACCGCAACC GCTACGACGT CCAGGAGTGT
GAACTGCTCG GCCTGGCCGA CCTGAACACC GGGTCCGACT ACGTACGGGG GCGTATCGCT
GGCTACCTCA ACGACCTTCT CTCCCTCGGC GCGGACGGCT TCCGCATCGA CGCGGCCAAG
CACATCGCCG CCAGCGACCT GGCGGCGATC CGCTCCCGGA TGAGCAACCC CAACGCCTAC
TGGATCCAGG AAGTGATCTA CGGTGCCGGT GAGGCGGTCC AGCCCAGCGA GTACCTTGGC
ACGGGCGACG TGCAGGAGTT CCGCTACGCG CGGGACCTGA AGCGGGTGTT CCAGAACGAG
AAGCTGGCCT ATCTGCGCAA CTACGGCGAA GGTTGGGGCT ACCTGTCCAG CGGCAAGGCC
GGCGTCTTCG TCAACAACCA CGACACCGAA CGCAACGGCG AGACCCTCTC CTACAAGAAC
GGCTCCGACT ACACGCTCGC CAACGTGTTC ATGCTCGCCT GGCCGTACGG CACGCCGCAC
GTGCACTCCG GCTACGAGTT CAGCGACCGG GACGCCGGCC CACCCAACGG CGGCCACGTC
AACGCCTGCT ACTCCGACGG GTGGACGTGT CAACACGCCT GGCGCCAAAT AGCCAACATG
GTGGGCTTCC GCAACGCCGC CGCCGGGACC GGTGTGACGA ACTGGTGGGA CAACGGCAAC
GACCAGATCG CGTTCGGCCG CGGTGACCGT GCCTTCGTCG CCATCAACCA GGAAGGCGGC
ACCCTCACCC GAACCTTCCA GACGTCACTG CCCGCCGGCA CCTACTGCGA CGTGCAGCAC
GGCGACCCGA CCACGAGCGG TGGATGCACC GGCCCCACCT ACACGGTCAA CTCCTCGGGC
CAGTTCGCCG CGAGCATCGG CCCGGGTGAC GCGGTCGCCC TCTACCGCGG CGCCGCGGGC
AGCCCGACTC CGGACCCGTC CCAGTCCCCG TCGGATCGCG TCAACGTCAC GTTCGCGGTC
ACCGCCACCA CCGTCTGGGG GCAGAACATC TTCGTCGTCG GTGACCACCC TGACCTCGGC
TCATGGAACC CCGACCGCGC CCTGCCGATG AGCGCCGCCA GCTACCCCCA GTGGCGGCTG
ACCACTCCCC TGCCCAGCGG CAGCGCCATC CAGTACAAGT ACATCCGCAA GGAGTCCAAC
GGTCACGTTA CCTGGGAAAG CGGCAACAAC CGGACCGCCA CGATCCCGAA CAGCGGAACA
CTGACCCTGA CCGACAATTG GCGAAACTGA
 
Protein sequence
MLTTLGPHRW RRRAAGLLAA ALVTAITAIT ATSLPANVQA SPPGDRDVTA VLFEWRFDSI 
ARACQDTLGP KGYGFVQVSP PQEHIQGWQW WTSYQPVSYD ISSRLGDRNA FRAMTEACHG
AGVKVIVDAV INHMTAGSGT GTGGTNYNKY DYPGFYQVQD FHSCRKHISD YRNRYDVQEC
ELLGLADLNT GSDYVRGRIA GYLNDLLSLG ADGFRIDAAK HIAASDLAAI RSRMSNPNAY
WIQEVIYGAG EAVQPSEYLG TGDVQEFRYA RDLKRVFQNE KLAYLRNYGE GWGYLSSGKA
GVFVNNHDTE RNGETLSYKN GSDYTLANVF MLAWPYGTPH VHSGYEFSDR DAGPPNGGHV
NACYSDGWTC QHAWRQIANM VGFRNAAAGT GVTNWWDNGN DQIAFGRGDR AFVAINQEGG
TLTRTFQTSL PAGTYCDVQH GDPTTSGGCT GPTYTVNSSG QFAASIGPGD AVALYRGAAG
SPTPDPSQSP SDRVNVTFAV TATTVWGQNI FVVGDHPDLG SWNPDRALPM SAASYPQWRL
TTPLPSGSAI QYKYIRKESN GHVTWESGNN RTATIPNSGT LTLTDNWRN