Gene Strop_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0601 
Symbol 
ID5057042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp680912 
End bp682120 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID640472871 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_001157459 
Protein GI145593162 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCC AATCCATCGG TCTGTCCACC GGGGGCCGGG ACCGCGCGCC AGAGTGGGAG 
CACTACCTCA AGGGTGACCC GGCGCTGTTG AACAGCGAGC TCGGCGACAT CCTGGACCGG
GATGCGGTGC GGCTCGACAT CCGCACCGAC GACCAGGTCG CGTACGAGGC GCTGCACCAC
CCGGACCCGA TGGTCCGCGA GCAGTCCCTG TACCAGGCGA TGGACCGCCG GTTGCCGGAA
GCGATCGACC TGATCGCGGA AAGCATCGCC ACCGATGAAA ACCGCGAGGT GCGGTGGAAC
GCACTGTGGG CGCTGGAAAA GATCGGCGGG CCGCGGGCCC TACAGATCAT CGAACGGCAC
GTCAACGACG ACGACGCCGA CGTCGGCGAG TGGGCACAAC TGTTTAGCTC CGAGCTGCGC
ACCGGCCTGC CTGCCTTCGA CAACCGGTCG TTCGCCTGGG ACAGCGACCG GACCTTCGAC
GAGACGATTC TGCTCAACAT CCACTGTGAC GTCTACGTTG CACTGGATGA GACGGGACGC
AACTGGGGGA AGATCTCCCT GGCGCCCCAG GGCTTGGCCC GCAGCTACGG TCAGGCGCAC
GCGTGCCCGA ACACGGACAC CCGTAACCAG AAGCTCATCA TCAGCAAGAC ACTGTCCGGC
CTGCATGAGG ACGGAACGCC GCACACGGAG AACTTCGTGT TCCGGGGGCT CACAAACCAC
GCCAACGCCG GCCGCGGCAG CTTCTACTTC GAGTCACGCG GTCTGCGGCC GATCTTCCTA
TCCGGCCGCG CCGACGACGA CAGCCTGGGA CACCGCAACG AGATGGTCGC CGCCAAGCGC
AGTGGCGAGT GGACCCTCGA CCCGAGGATC CAGATCAGGG GCGAGTCGGC GATCCGCTAC
GTCCGGGGTC GGGTGCACAC CTGGGGCTAC GTCAACTTCG ACACCATGGC GGGCAGCTCG
CTGGAGGAGG TGCTGTTCCC CGGCAACAGC ATCCTCGGCA CGCTGGACAC CCCCACCGGG
CCGCTGGCGA ACGCGTTCAT CGTGGGCACG TTCAAGGGCA AGCTGGTCGA CTGGGATGGC
GACGACAAGG TCAATGTCAA CTCGCTCGAC ATCTACTCGA CGCTGGACGG GGACGTCGAC
TCCGACCAGG ACGGCGTCGC CGACATCCCG GGGGTGCAGT TCTGCCCCCG TACCAACTGG
ATGAACTGA
 
Protein sequence
MTLQSIGLST GGRDRAPEWE HYLKGDPALL NSELGDILDR DAVRLDIRTD DQVAYEALHH 
PDPMVREQSL YQAMDRRLPE AIDLIAESIA TDENREVRWN ALWALEKIGG PRALQIIERH
VNDDDADVGE WAQLFSSELR TGLPAFDNRS FAWDSDRTFD ETILLNIHCD VYVALDETGR
NWGKISLAPQ GLARSYGQAH ACPNTDTRNQ KLIISKTLSG LHEDGTPHTE NFVFRGLTNH
ANAGRGSFYF ESRGLRPIFL SGRADDDSLG HRNEMVAAKR SGEWTLDPRI QIRGESAIRY
VRGRVHTWGY VNFDTMAGSS LEEVLFPGNS ILGTLDTPTG PLANAFIVGT FKGKLVDWDG
DDKVNVNSLD IYSTLDGDVD SDQDGVADIP GVQFCPRTNW MN