Gene Strop_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3041 
Symbol 
ID5059505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3472079 
End bp3474751 
Gene Length2673 bp 
Protein Length890 aa 
Translation table11 
GC content72% 
IMG OID640475291 
Producttetratricopeptide TPR_4 
Protein accessionYP_001159856 
Protein GI145595559 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGCG CCGACCAGAT GGACGACAGT CCAGCAGGCA GAGCGCTGCA CCTCGTCACC 
GACGACCCGG CTCGCGCCGC CGAGTTGGCC GATGCCGCAC TCGCCGCTGC CCGTGCCGGT
GCGGATCCGG ACGAGGAGAC CGTGGCGCTG CGGGCGCTCG GTCTCGCCGC GCGGGAGCAA
CACAACGCAG ACCACGGGCT ACGCCATCTG CGCCGGGCAC GCCGCGTGGC CGAGGCGGCC
AAGCTCCCGG CCCGCGCCGC CGAAGCCCGG ATGAGCCTGG CGCTGGTCCT GGCCGAGGCC
GGCCAACCGC AACAGGCGCT GCGGGAGATC GATGCCGCGT CACCGGCACT TCGTGGGCTG
CCCGCCGCGC GCCTACAGAT GCAGCGGGCG CTGATCCTCG ATCGCCTCGG CCGCTCCGAC
GAGGCGATGG CCGGCTACAC CGACGCGCTG GCCGCGTTCC GACGGAGCGG GGACCAGCTG
TGGCAGTCCC GCGCACTCAC CAACAGAGGG GTCCTGCACA CCTACCGGGG TAGCCTGCGC
CAGGCCGAGG CTGACCTGCG GGCGGCGGAG CAGGGCTACG CGGAGCTCGG CCAGGAGCTG
GCTCTCGCCC AGGTCCGCCA CAACCTTGGC TTCGTACTAT CTCGCGCGGG CGACATTCCC
GGCGCACTGC GCTGGTATGA CCTGGCCGAC CAGTACTTCG CCCAGACCAC CCGCCCGGCG
ATCGCCCTGA TGGATCGGGG CGAGCTACTA CTGGGCGCCC GGCTACTGCC GGAGGCTCGT
GCGGCGGCCG AGGCTGCCCT GGACGCGGCC CGGGCAAGCC GAATGGGGCT CTACGAGGCA
CAGGCCCGGC TGATGCTGGC CGAAGTCGCG CTGGCCGAGG TCGATCTGCC GACCGCCTCG
AAGCAGGCGC AGGCCGCGTA CCGCCTGTTC AGTCGACAGA ACCGACCGGC GTGGGCAGCG
CTCTCCCGAT ACGTCAAGCT ACGGGCCAGC TCGCCGGACG AACCGGCGCA CCTGCGGCAG
GCACGCTCGG TGGCACGTGC ACTGGCAACG AGTAGCTGGC CGATACCCGC ACTCGACGCG
CGTCTCTACG CCGCCCGAGT CGCCCTGGAC CGGGCGCTTC GTACCGGGCA ACGCGCCAAG
GCCGCAGCCA TCGCCGGCGA GTTGAGTGCC ATTCGAGCCG CCAGCCGAGG CGGACCGGCC
CAGCTCCGGG CCCGCGCCTG GCATGCCGAG GCACTCCGAC GAATCGCCAC TGGCGATATC
GCCGGGGCGC GCCGGGCCCT CAATGCGGGA ATGGCCCTGC TGGACCGCTA CCAGGCAGCC
CTCGGCGCCA CAGAGCTGCG GGTGATGGCC GGGGCATACG CCCTGGACCT GGCCAGCACC
GGCCTTCGAC TCGCGGTTCG TGGCGGTCAG GCTCGGGCGA TCCTACGCTG GAGTGAGCGT
TGGCGGGCCG CCGCGCTGCG GTTGCCCCCC GCCCGACCGC CGGACGAAGC CGGGCTTGCT
GCCGACCTGG CCGAGTTGCG CCGCACCGTG GACGAATCCG CCCAGGCGAG TGCCGCGTTG
GGGTTAACCC TGTTGCGTCG ACAGCGGACG ATCGAGGAAC GAATCCGCAC CCGTAGTTGG
CAGGCCAGCG GCACCGAGTC GGGTTCGGCG AGCGGGATAT CGCTGGACCG GATCGCGATG
GAACTGGCTG ACCAGACGCT GGTCGAGCTG GTCGACATTG ACCGGACGCT ACACGCCGTC
GTGGTGCACG GCGGCCGGTT TCACACCCGG GCGCTCGGTG CGCTCGCCGC CGTCACGGAC
GAGCTCCAAG CCCTGCGTTT TGCGCTACGA CGCATCCTGA CCAGCCGCGG CACCGACGAC
TCCCGGGCGG CGGCGGCGAC CGCGGCCCGA TTCGCGGTAC ACCAGCTGGA CGACATGATC
TTCGGTGCGA TACAGCCCTG GCTCGGCGCT GGCGGACTGG TCCTGGTACC GGTCGGCGCA
CTGCACGCCA TGCCCTGGGC GCTACTCCCG ACCTGCGCCG GGCGACCGGT CACGGTGGTG
CCATCGGCCT CCGAGTGGCT CACTGCATCG GCCCGCCGCC AGGCCTCCCA CCCCACGATG
ACCACTCCGC CCCACCAGCG GAACCCGGTG CTGGTAGCCG GGCCCGGCCT CGCCTACGCC
GAAGCGGAGG TGCAGCGGTT GGCGGGGGCA CTCGCTCCCG TCCAGGTGCT CCTCGGCTCC
GACGCGACCG CCGAGGCAGC GCTCGCGGCG CTGGATGGTG CTCCCCTTGC CCACCTCGCC
GCACACGGCA CGTTCCGCAC CGACAATCCC ATGTTCTCGC ACGTACGCCT CGCCGACGGC
CCCCTGACCG TCTACGACCT GGAACGACTC GCCTGCGCGC CTGGCACGGT GGTGCTCTCC
GCCTGCGACG TGGGGTTGTC AGCGGTGCAT CCGGGCGAGG AGTTGATGGG ACTGTCGGCC
GCGCTGCTTC AGCTGGGAAC GGCGACGGTG CTGGCCAGTG TCCTGCCGGC ACTGGACTCC
GCAGCGCAGG AGCTGATGGT GGAGATGCAT CGACGGCTGG CCACCGGCGA CACACCGGGG
CTCGCCCTCG CCGGCGCACA GGCGGAGTTC GGTACTGGCC TGGACGCCGG CACCGCAACC
GCCGCCTCGT TTGTCTGCTT CGGCGCCGGC TGA
 
Protein sequence
MGSADQMDDS PAGRALHLVT DDPARAAELA DAALAAARAG ADPDEETVAL RALGLAAREQ 
HNADHGLRHL RRARRVAEAA KLPARAAEAR MSLALVLAEA GQPQQALREI DAASPALRGL
PAARLQMQRA LILDRLGRSD EAMAGYTDAL AAFRRSGDQL WQSRALTNRG VLHTYRGSLR
QAEADLRAAE QGYAELGQEL ALAQVRHNLG FVLSRAGDIP GALRWYDLAD QYFAQTTRPA
IALMDRGELL LGARLLPEAR AAAEAALDAA RASRMGLYEA QARLMLAEVA LAEVDLPTAS
KQAQAAYRLF SRQNRPAWAA LSRYVKLRAS SPDEPAHLRQ ARSVARALAT SSWPIPALDA
RLYAARVALD RALRTGQRAK AAAIAGELSA IRAASRGGPA QLRARAWHAE ALRRIATGDI
AGARRALNAG MALLDRYQAA LGATELRVMA GAYALDLAST GLRLAVRGGQ ARAILRWSER
WRAAALRLPP ARPPDEAGLA ADLAELRRTV DESAQASAAL GLTLLRRQRT IEERIRTRSW
QASGTESGSA SGISLDRIAM ELADQTLVEL VDIDRTLHAV VVHGGRFHTR ALGALAAVTD
ELQALRFALR RILTSRGTDD SRAAAATAAR FAVHQLDDMI FGAIQPWLGA GGLVLVPVGA
LHAMPWALLP TCAGRPVTVV PSASEWLTAS ARRQASHPTM TTPPHQRNPV LVAGPGLAYA
EAEVQRLAGA LAPVQVLLGS DATAEAALAA LDGAPLAHLA AHGTFRTDNP MFSHVRLADG
PLTVYDLERL ACAPGTVVLS ACDVGLSAVH PGEELMGLSA ALLQLGTATV LASVLPALDS
AAQELMVEMH RRLATGDTPG LALAGAQAEF GTGLDAGTAT AASFVCFGAG