Gene Strop_3823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3823 
Symbol 
ID5060301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4379480 
End bp4381105 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content65% 
IMG OID640476081 
Product4-phytase 
Protein accessionYP_001160632 
Protein GI145596335 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGGA AATTCCTGAA GGTGGCGGTC GCGGCGACCG CCACCGCTAT GTTGGCCACT 
GCCTGCGGTG GCGGCGGCTC GAGCGATGCT GATGGTGACG CCAGTGGCAC GCTTCGTGTA
TATGCGTCAG AGCCGGCGTA CCTGCTGCCG TCGAACGCCG ACGACGAGCC GTCGATCTAC
GTGATCCGTC AGCTCTACCG CGGTCTGGTC AAGTACAACG CCGAGACCAG TGAGTCGGAG
ATGGATCTGG CCGAGTCGAT CACCTCGGAC GACCAGAAGC TCTGGACGAT CAAGCTGAAG
GACGGCTACA CCTTCGACAA CGGTGAGCCG GTCGACGCCG ACTCCTTCAT CCGGTCGTGG
AACTTCGCCG CCTACGCGCC GAACGCCCAG AACAACGGCT ACTTCATGAA GCGGATCGCC
GGTATCGACG AGGTCCTGCC CAAGGACCCG GACGGCGAGG GCCCGGCGGA GGCCCCGGCA
CCGACGGTCG AGACGATGTC CGGCCTGACG AAGGTCGACG ACCTCACCTT CACCGTCGAG
CTCAAGGAGC CCTTCACCGG CTTCCCGACC ATTGTCGGGT ACTCGGGCTT CTTCCCGATG
GCCCAGGCCT GCGTCGACGA CGCGGATGCC TGCAACGAGA CCCCGATCGG TAACGGCCCT
TACAAGATCG ACGGTAGCTG GGAACACGAC GTCGAGATCA ACCTGGTCCG CAGCGAGACC
TGGAAGGGCG AGCCGGGCAA GCCCGAGGCG ATCAACTACC GGATCTTCGC CGACGTGGAC
GGCGCCTACG CCGCCTTCCA GGCCGGCGAG CTGGACGTGA TGTACACGAT CCCGCCGGCG
CGCTTCAAGG ACGCCAAGGC CAGCTACGGC GACCGGCTGT ACGAGCAGGC GGGCGACAGC
CTCAACTACG TCGGCATGCC GCTGTACGAC GACAGCTTCA AGGACAAGCG GATCCGCCAG
GCGATCTCGC TGGCGATCGA CCGGCAGTCC ATCGTTGACG CCGTCTTCGA CGGACGGTGG
ACTCCCGCCA CCGGCTTCGT CGCGCCGATC TTCGAGGGCG CTCGCGAGGG TATCTGCGCC
TACTGCGAGA AGGACGTCGA GAAGGCCAAG GAACTGCTCG CGGCGGCCGG TGGCTGGCCG
GAGGGCAAGA AGCTGACCCT GTGGGCCAAC GCGGGTGCTG GCCACGACGC CTGGCTCCAG
GCCGTCGGCG ACCAGGTCAA GGCCGCGCTG GGCATCGACT ACGAGCTGAA GGTCAACCTG
CAGTTCGCCG AGTACCTGGA CGTGGCGGAC AACCGGGAGT TCACCGGCCC GTTCCGGCTC
GGCTGGGGCC CGGACTACCC GTTCCTGGAG ACCTACCTGA CTCCGCTGTA CAGCACCGGC
AACGACAGCA ACAACAGCAC CTTCAGCAAC CCCGAGTTCG ACAACCTGCT GAAGCAGGGC
GACGCCGCTC CGACCATGGA GGAGGCCATC ACCTTCTACC AGCAGGCTGA GGACATCCTG
GCTGAGGAGA TGCCGGTCAT CCCGATGTTC TGGCGCAAGG AAGCGGCGGT CTACAGCGAG
AACGTGGACG CCTTTGTCTG GAACCAGGTC ATGGGCGCCG ACTACGGTGC GACCTCACTG
AAGTAG
 
Protein sequence
MRGKFLKVAV AATATAMLAT ACGGGGSSDA DGDASGTLRV YASEPAYLLP SNADDEPSIY 
VIRQLYRGLV KYNAETSESE MDLAESITSD DQKLWTIKLK DGYTFDNGEP VDADSFIRSW
NFAAYAPNAQ NNGYFMKRIA GIDEVLPKDP DGEGPAEAPA PTVETMSGLT KVDDLTFTVE
LKEPFTGFPT IVGYSGFFPM AQACVDDADA CNETPIGNGP YKIDGSWEHD VEINLVRSET
WKGEPGKPEA INYRIFADVD GAYAAFQAGE LDVMYTIPPA RFKDAKASYG DRLYEQAGDS
LNYVGMPLYD DSFKDKRIRQ AISLAIDRQS IVDAVFDGRW TPATGFVAPI FEGAREGICA
YCEKDVEKAK ELLAAAGGWP EGKKLTLWAN AGAGHDAWLQ AVGDQVKAAL GIDYELKVNL
QFAEYLDVAD NREFTGPFRL GWGPDYPFLE TYLTPLYSTG NDSNNSTFSN PEFDNLLKQG
DAAPTMEEAI TFYQQAEDIL AEEMPVIPMF WRKEAAVYSE NVDAFVWNQV MGADYGATSL
K